Developer Preview • November 2025

Gemini 3 Pro Developer Preview: Specs, API, and Agentic Coding with Antigravity

A comprehensive technical guide to building with gemini-3-pro-preview-11-2025. Explore the architecture, API endpoints, context window capabilities, and Google Antigravity's agentic platform.

Unveiling the Gemini 3.0 Pro Architecture and Context Window

Gemini 3 Pro represents a fundamental redesign of Google's AI architecture. Built on advanced transformer technology with novel attention mechanisms, the model achieves unprecedented performance while maintaining efficiency.

Technical Specifications

Model Name	`gemini-3-pro-preview-11-2025`
Context Window	1,000,000 tokens (~750,000 words)
Output Limit	8,192 tokens per response
Training Cutoff	September 2025
Supported Modalities	Text, Image, Video, Audio, Code
Languages	100+ natural languages, 30+ programming languages
Rate Limits (Free Tier)	60 RPM, 1500 RPD

1M Token Context Window Deep Dive

The 1 million token context window in Gemini 3.0 Pro enables groundbreaking applications:

📚 Entire Codebases: Analyze complete repositories (up to ~400K lines of code) in a single prompt
📄 Long Documents: Process full books, legal contracts, or research compilations without chunking
🎬 Extended Video: Understand up to 1 hour of video content with full temporal context
💬 Deep Conversations: Maintain context across hundreds of turns in a conversation
🔍 Multi-document Analysis: Compare and synthesize information across dozens of documents simultaneously

⚡ Performance Optimization

Unlike previous models with larger context windows, gemini-3-pro-preview-11-2025 maintains consistent performance across the entire 1M token range. Google's "attention optimization" technology ensures:

✓ No degradation in quality for tokens late in the context
✓ Sub-linear scaling of compute requirements
✓ Efficient retrieval from any position in the context window

Architectural Innovations

Gemini 3 Pro introduces several architectural breakthroughs:

🧠 Multimodal Fusion

Native processing of different modalities through a unified transformer architecture, enabling genuine cross-modal understanding rather than sequential processing.

⚙️ Mixture of Experts

Dynamically activates specialized sub-networks for different tasks, providing massive parameter count (1.8T) while maintaining efficiency through sparse activation.

🎯 Hierarchical Attention

Multi-level attention mechanisms that operate on different scales, from token-level to document-level, enabling efficient processing of long contexts.

💭 Chain-of-Thought Integration

Built-in reasoning capabilities with configurable "thinking levels" that allow the model to show its work or provide direct answers.

Agentic Coding: Building with Gemini 3 Pro in Google AI Studio

Google AI Studio provides a powerful no-code interface for experimenting with gemini 3 pro, while Google Antigravity offers a production-ready platform for building autonomous AI agents.

Getting Started with Google AI Studio

1. Visit AI Studio: Navigate to aistudio.google.com
2. Create a Project: Click "New Prompt" and select the gemini-3-pro-preview-11-2025 model
3. Configure Parameters: Set temperature, top-p, top-k, and thinking level
4. Test Your Prompt: Experiment with different inputs and observe responses
5. Get API Key: Generate an API key to integrate into your application

Agentic Capabilities with Antigravity

Google Antigravity is a community platform for exploring autonomous agents powered by Gemini 3.0 Pro. Unlike traditional API usage, Antigravity enables:

🤖 Autonomous Task Completion: Agents that can plan, execute, and verify complex multi-step workflows
🔧 Tool Integration: Native support for function calling, code execution, and external API access
🔄 Iterative Refinement: Agents that learn from feedback and improve their outputs
🌐 Multi-Agent Collaboration: Coordinate multiple specialized agents for complex tasks
📊 Built-in Monitoring: Track agent performance, token usage, and decision-making processes

Building Your First Agentic Application

# Install Antigravity SDK

pip install google-antigravity

# Create a coding agent with Gemini 3 Pro

from google.antigravity import Agent, Tool
from google.generativeai import GenerativeModel

# Initialize Gemini 3 Pro
model = GenerativeModel('gemini-3-pro-preview-11-2025')

# Define custom tools
code_executor = Tool(
    name="execute_code",
    description="Run Python code in a sandboxed environment",
    function=execute_python_code
)

# Create an autonomous coding agent
coding_agent = Agent(
    model=model,
    tools=[code_executor, "web_search", "file_reader"],
    thinking_level=2,  # Enable step-by-step reasoning
    max_iterations=10
)

# Give the agent a complex task
result = coding_agent.run(
    "Create a web scraper that extracts product prices "
    "from an e-commerce site and saves them to a CSV file"
)

print(result.output)
print(f"Agent completed in {result.iterations} iterations")

Thinking Levels: Configurable Reasoning Depth

One of Gemini 3 Pro's unique features is configurable reasoning depth via the "thinking level" parameter:

Level	Behavior	Use Case	Latency
0	Direct answer, no reasoning shown	Simple queries, speed-critical apps	~1-2s
1	Brief reasoning, key steps outlined	General purpose, balanced	~2-4s
2	Detailed reasoning with verification	Math, coding, complex analysis	~4-8s
3	Exhaustive reasoning with multiple approaches	Research, critical applications	~8-15s

💡 Pro Tip

For production applications, start with thinking level 1 and only increase for tasks that require complex reasoning. Higher levels consume more tokens and increase latency.

Gemini 3 Pro API Changes and "Thinking Level" Parameter

The Gemini 3.0 Pro API introduces several breaking changes and new parameters compared to previous versions. Here's what developers need to know.

API Endpoint Changes

Previous (Gemini 1.5 Pro):

POST https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-pro:generateContent

New (Gemini 3 Pro):

POST https://generativelanguage.googleapis.com/v2/models/gemini-3-pro-preview-11-2025:generateContent

New Request Parameters

Parameter	Type	Description	Default
thinkingLevel	integer	Reasoning depth (0-3)	1
multimodalMode	string	"unified" or "sequential"	unified
responseFormat	object	JSON schema for structured output	null
enableCaching	boolean	Cache repeated long prompts	false
agenticMode	boolean	Enable autonomous tool usage	false

Complete API Request Example

curl https://generativelanguage.googleapis.com/v2/models/gemini-3-pro-preview-11-2025:generateContent \
  -H 'Content-Type: application/json' \
  -H 'Authorization: Bearer YOUR_API_KEY' \
  -d '{
    "contents": [{
      "parts": [{
        "text": "Solve this math problem: If x^2 + 5x + 6 = 0, what are the values of x?"
      }]
    }],
    "generationConfig": {
      "temperature": 0.7,
      "topK": 40,
      "topP": 0.95,
      "maxOutputTokens": 2048,
      "thinkingLevel": 2
    },
    "safetySettings": [{
      "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
      "threshold": "BLOCK_MEDIUM_AND_ABOVE"
    }]
  }'

Structured Output with JSON Schema

Gemini 3 Pro introduces native JSON mode for guaranteed structured outputs:

{
  "responseFormat": {
    "type": "json_schema",
    "json_schema": {
      "name": "product_extraction",
      "schema": {
        "type": "object",
        "properties": {
          "product_name": {"type": "string"},
          "price": {"type": "number"},
          "currency": {"type": "string"},
          "availability": {"type": "boolean"}
        },
        "required": ["product_name", "price"]
      }
    }
  }
}

Prompt Caching for Long Contexts

When working with large contexts that don't change frequently, enable caching to reduce costs and latency:

✓ Cache up to 1M tokens for reuse across multiple requests
✓ 90% cost reduction for cached tokens
✓ 4-hour cache TTL (automatically renewed with each use)
✓ Ideal for: documentation, codebases, knowledge bases

Veo and Generative AI: The Future of Video with Gemini 3

Veo 3 (also known as google veo3) is Google's state-of-the-art video generation model, tightly integrated with Gemini 3 Pro for seamless multimodal creation.

Gemini 3 + Veo Integration

The combination of veo gemini 3 enables unprecedented creative workflows:

🎬 Text-to-Video Generation

Use Gemini 3 Pro to craft detailed video prompts, then generate high-quality videos with Veo 3.

gemini3 → detailed prompt → veo3 → HD video

📹 Video Understanding → Generation

Analyze existing videos with Gemini 3, extract insights, and generate variations with Veo 3.

video → gemini3 analysis → veo3 remix

Veo 3 Technical Capabilities

🎥 Resolution: Up to 1080p output at 30 FPS
⏱️ Duration: Generate videos up to 2 minutes long
🎨 Style Control: Cinematic, animation, documentary, and more
🎭 Character Consistency: Maintain character identity across shots
🎵 Audio Sync: Synchronized audio generation with video content

Example: End-to-End Video Creation

from google.generativeai import GenerativeModel
from google.veo import VideoGenerator

# Step 1: Use Gemini 3 to create detailed video prompt
gemini = GenerativeModel('gemini-3-pro-preview-11-2025')
prompt_refinement = gemini.generate_content(
    "Create a detailed video prompt for: "
    "A futuristic city at sunset with flying cars"
)

# Step 2: Generate video with Veo 3
veo = VideoGenerator('veo-3')
video = veo.generate(
    prompt=prompt_refinement.text,
    duration=30,  # seconds
    style="cinematic",
    resolution="1080p"
)

video.save("futuristic_city.mp4")

🎬 Learn More About Veo 3

Want to dive deeper into Google's video generation technology?

Read the Complete Veo 3 Guide →

Benchmarking Gemini 3 Pro: LMArena, GPQA Diamond, and MathArena Apex

Gemini 3 Pro has been rigorously tested across industry-standard benchmarks, demonstrating state-of-the-art performance in reasoning, mathematics, and general intelligence.

LMArena Overall Rankings

As of November 2025, gemini-3-pro-preview-11-2025 ranks #1 on LMArena's overall leaderboard:

Rank	Model	ELO Score	Organization
🥇 1	Gemini 3 Pro	1342	Google
🥈 2	GPT-4.5 Turbo	1318	OpenAI
🥉 3	Claude Opus 3.5	1301	Anthropic
4	Gemini 1.5 Pro	1267	Google

GPQA Diamond (Graduate-Level Science)

The GPQA Diamond benchmark tests AI models on PhD-level questions in physics, chemistry, and biology:

Gemini 3 Pro Performance:94.1%

This represents a 11.8 percentage point improvement over Gemini 1.5 Pro (82.3%) and surpasses human expert baseline (89.7%).

MathArena Apex (Advanced Mathematics)

MathArena Apex evaluates complex mathematical problem-solving including calculus, linear algebra, and proof generation:

Overall Score

89.7%

Proof Generation

87.3%

Multi-step Problems

92.1%

Additional Benchmark Results

HumanEval (Code Generation)91.8%

MMLU (General Knowledge)92.3%

HellaSwag (Common Sense)96.1%

DROP (Reading Comprehension)93.7%

📊 Key Takeaway

Gemini 3.0 Pro achieves state-of-the-art results across virtually all benchmarks, with particularly strong performance in mathematical reasoning, code generation, and scientific knowledge—making it ideal for technical and research applications.