
Two Ways to Advantage of Claude Sonnet 4's 1M Context Window in Cline
Claude Sonnet 4 now supports 1 million tokens of context – a 5x increase from the previous 200K limit.

Claude Sonnet 4 now supports 1 million tokens of context – a 5x increase from the previous 200K limit. This isn't just a bigger number; it fundamentally changes how you should approach complex development tasks in Cline.
Stop being context-stingy
With 200K tokens, you had to be strategic about what context to include. Should you load that documentation? Include those test files? Pull in data from your MCP servers? These decisions created friction and often meant working with incomplete information.
With 1M tokens, that calculus changes. Load everything that's relevant. Pull in comprehensive documentation, include related files, and let your MCP servers provide rich context without worrying about hitting limits mid-conversation.
Two new ways to use Cline w/ 1M Sonnet 4
(1) deeper planning sessions
Use Plan mode differently now. Instead of quick planning sessions followed by context-limited execution, you can have comprehensive planning conversations. Load up all your project context, discuss architecture decisions thoroughly, explore edge cases, and refine your approach – all before switching to Act mode with that full context intact.
(2) extended development cycles
Complex features often require multiple iterations: write code, test it, debug issues, refactor, test again. Previously, these cycles would eventually hit context limits, forcing you to start fresh and lose valuable project awareness.
Now these cycles can run much longer while maintaining full context. Cline can remember the entire development journey, making better decisions based on what's been tried, what failed, and why certain approaches were chosen.
(It's more expensive above 200k tokens)
This expanded context comes with higher pricing for prompts over 200K tokens:
Input | Output | |
---|---|---|
Prompts ≤ 200K | $3 / MTok | $15 / MTok |
Prompts > 200K | $6 / MTok | $22.50 / MTok |
You'll also need Tier 4 access on Anthropic's API, though you can access it directly through Cline and OpenRouter.
Note that Anthropic also offers prompt caching and batch processing options that can help reduce costs for large context usage.
Rethinking your approach
The key shift is moving from context scarcity to context abundance. Instead of asking "What's the minimum context I need?" ask "What's all the relevant context that could help?"

This isn't about loading your entire codebase into every conversation. It's about not having to make artificial trade-offs between thoroughness and context limits. When you're working on a complex feature that touches multiple systems, you can now include all the relevant pieces without compromise.
The 1M context window doesn't just give you more space – it gives you permission to work more naturally, without the constant mental overhead of context management getting in the way of solving actual problems.