Optimized the root .gitignore to exclude virtual environments, node modules, and temp folders to ensure clean and lightweight version tracking. Co-authored-by: Cursor <cursoragent@cursor.com>
260 lines
9.0 KiB
Markdown
260 lines
9.0 KiB
Markdown
# Deep Research Skill for Claude Code
|
|
|
|
A comprehensive research engine that brings Claude Desktop's Advanced Research capabilities (and more) to Claude Code terminal.
|
|
|
|
## Features
|
|
|
|
### Core Research Pipeline
|
|
- **8.5-Phase Research Pipeline**: Scope → Plan → Retrieve (Parallel) → Triangulate → **Outline Refinement** → Synthesize → Critique → Refine → Package
|
|
- **Multiple Research Modes**: Quick, Standard, Deep, and UltraDeep
|
|
- **Graph-of-Thoughts Reasoning**: Non-linear exploration with branching thought paths
|
|
|
|
### 2025 Enhancements (Latest - v2.2)
|
|
- **🔄 Auto-Continuation System (NEW)**: **TRUE UNLIMITED length** (50K, 100K+ words) via recursive agent spawning with context preservation
|
|
- **📄 Progressive File Assembly**: Section-by-section generation with quality safeguards
|
|
- **⚡ Parallel Search Execution**: 5-10 concurrent searches + parallel agents (3-5x faster Phase 3)
|
|
- **🎯 First Finish Search (FFS) Pattern**: Adaptive completion based on quality thresholds
|
|
- **🔍 Enhanced Citation Validation (CiteGuard)**: Hallucination detection, URL verification, multi-source cross-checking
|
|
- **📋 Dynamic Outline Evolution (WebWeaver)**: Adapt structure after Phase 4 based on evidence
|
|
- **🔗 Attribution Gradients UI**: Interactive citation tooltips showing evidence chains in HTML reports
|
|
- **🛡️ Anti-Fatigue Enforcement**: Prose-first quality checks prevent bullet-point degradation
|
|
|
|
### Traditional Strengths
|
|
- **Citation Management**: Automatic source tracking and bibliography generation
|
|
- **Source Credibility Assessment**: Evaluates source quality and potential biases
|
|
- **Structured Reports**: Professional markdown, HTML (McKinsey-style), and PDF outputs
|
|
- **Verification & Triangulation**: Cross-references claims across multiple sources
|
|
|
|
## Installation
|
|
|
|
The skill is already installed globally in `~/.claude/skills/deep-research/`
|
|
|
|
No additional dependencies required for basic usage.
|
|
|
|
## Usage
|
|
|
|
### In Claude Code
|
|
|
|
Simply invoke the skill:
|
|
|
|
```
|
|
Use deep research to analyze the state of quantum computing in 2025
|
|
```
|
|
|
|
Or specify a mode:
|
|
|
|
```
|
|
Use deep research in ultradeep mode to compare PostgreSQL vs Supabase
|
|
```
|
|
|
|
### Direct CLI Usage
|
|
|
|
```bash
|
|
# Standard research
|
|
python ~/.claude/skills/deep-research/research_engine.py --query "Your research question" --mode standard
|
|
|
|
# Deep research (all 8 phases)
|
|
python ~/.claude/skills/deep-research/research_engine.py --query "Your research question" --mode deep
|
|
|
|
# Quick research (3 phases only)
|
|
python ~/.claude/skills/deep-research/research_engine.py --query "Your research question" --mode quick
|
|
|
|
# Ultra-deep research (extended iterations)
|
|
python ~/.claude/skills/deep-research/research_engine.py --query "Your research question" --mode ultradeep
|
|
```
|
|
|
|
## Research Modes
|
|
|
|
| Mode | Phases | Duration | Best For |
|
|
|------|--------|----------|----------|
|
|
| **Quick** | 3 phases | 2-5 min | Simple topics, initial exploration |
|
|
| **Standard** | 6 phases | 5-10 min | Most research questions |
|
|
| **Deep** | 8 phases | 10-20 min | Complex topics requiring thorough analysis |
|
|
| **UltraDeep** | 8+ phases | 20-45 min | Critical decisions, comprehensive reports |
|
|
|
|
## Output
|
|
|
|
Research reports are saved to organized folders in `~/Documents/[Topic]_Research_[Date]/`
|
|
|
|
Each report includes:
|
|
- Executive Summary
|
|
- Detailed Analysis with Citations
|
|
- Synthesis & Insights
|
|
- Limitations & Caveats
|
|
- Recommendations
|
|
- Full Bibliography
|
|
- Methodology Appendix
|
|
|
|
### Unlimited Report Generation (2025 Auto-Continuation System)
|
|
|
|
Reports use **progressive file assembly with auto-continuation** - achieving truly unlimited length through recursive agent spawning:
|
|
|
|
**How It Works:**
|
|
|
|
1. **Initial Generation (18K words)**
|
|
- Generate sections 1-10 progressively
|
|
- Each section written to file immediately (stays under 32K limit per agent)
|
|
- Save continuation state with research context
|
|
|
|
2. **Auto-Continuation (if needed)**
|
|
- Automatically spawns continuation agent via Task tool
|
|
- Continuation agent loads state: themes, narrative arc, citations, quality metrics
|
|
- Generates next batch of sections (another 18K words)
|
|
- Updates state and spawns next agent if more sections remain
|
|
|
|
3. **Recursive Chaining**
|
|
- Each agent stays under 32K output token limit
|
|
- Chain continues until all sections complete
|
|
- Final agent generates bibliography and validates report
|
|
|
|
**Realistic Report Sizes:**
|
|
- **Quick mode**: 2,000-4,000 words (single run) ✅
|
|
- **Standard mode**: 4,000-8,000 words (single run) ✅
|
|
- **Deep mode**: 8,000-15,000 words (single run) ✅
|
|
- **UltraDeep mode**: 20,000-100,000+ words (auto-continuation) ✅
|
|
|
|
**Example: 50,000 word report:**
|
|
- Agent 1: Sections 1-10 (18K words) → Spawns Agent 2
|
|
- Agent 2: Sections 11-20 (18K words) → Spawns Agent 3
|
|
- Agent 3: Sections 21-25 + Bibliography (14K words) → Complete!
|
|
- Total: 50K words across 3 agents, each under 32K limit
|
|
|
|
**Context Preservation (Quality Safeguards):**
|
|
|
|
Continuation state includes:
|
|
- ✅ Research question and key themes
|
|
- ✅ Main findings summaries (100 words each)
|
|
- ✅ Narrative arc position (beginning/middle/end)
|
|
- ✅ Quality metrics (avg words, citation density, prose ratio)
|
|
- ✅ All citations used + bibliography entries
|
|
- ✅ Writing style characteristics
|
|
|
|
Each continuation agent:
|
|
- Reads last 3 sections to understand flow
|
|
- Maintains established themes and style
|
|
- Continues citation numbering correctly
|
|
- Matches quality metrics (±20% tolerance)
|
|
- Verifies coherence before each section
|
|
|
|
**Quality Gates (Per Section):**
|
|
- [ ] Word count: Within ±20% of average
|
|
- [ ] Citation density: Matches established rate
|
|
- [ ] Prose ratio: ≥80% prose (not bullets)
|
|
- [ ] Theme alignment: Ties to key themes
|
|
- [ ] Style consistency: Matches established patterns
|
|
|
|
**Benefits:**
|
|
- ✅ TRUE unlimited length (50K, 100K+ words achievable)
|
|
- ✅ Fully automatic (no manual intervention)
|
|
- ✅ Context preserved across continuations
|
|
- ✅ Quality maintained throughout
|
|
- ✅ Each agent stays under 32K token limit
|
|
- ✅ Progressive assembly prevents truncation
|
|
|
|
## Examples
|
|
|
|
### Technology Analysis
|
|
```
|
|
Use deep research to evaluate whether we should adopt Next.js 15 for our project
|
|
```
|
|
|
|
### Market Research
|
|
```
|
|
Use deep research to analyze longevity biotech funding trends 2023-2025
|
|
```
|
|
|
|
### Technical Decision
|
|
```
|
|
Use deep research to compare authentication solutions: Auth0 vs Clerk vs Supabase Auth
|
|
```
|
|
|
|
### Scientific Review
|
|
```
|
|
Use deep research in ultradeep mode to summarize recent advances in senolytic therapies
|
|
```
|
|
|
|
## Quality Standards
|
|
|
|
Every research output:
|
|
- ✅ Minimum 10+ distinct sources
|
|
- ✅ Citations for all major claims
|
|
- ✅ Cross-verified facts (3+ sources)
|
|
- ✅ Executive summary under 250 words
|
|
- ✅ Limitations section
|
|
- ✅ Full bibliography
|
|
- ✅ Methodology documentation
|
|
|
|
## Architecture
|
|
|
|
```
|
|
deep-research/
|
|
├── SKILL.md # Main skill definition
|
|
├── research_engine.py # Core orchestration engine
|
|
├── utils/
|
|
│ ├── citation_manager.py # Citation tracking & bibliography
|
|
│ └── source_evaluator.py # Source credibility assessment
|
|
├── requirements.txt
|
|
└── README.md
|
|
```
|
|
|
|
## Tips for Best Results
|
|
|
|
1. **Be Specific**: Frame questions clearly with context
|
|
2. **Set Expectations**: Specify if you need comparisons, recommendations, or pure analysis
|
|
3. **Choose Appropriate Mode**: Use Quick for exploration, Deep for decisions
|
|
4. **Review Scope**: Check Phase 1 output to ensure research is on track
|
|
5. **Leverage Citations**: Use citation numbers to drill deeper into specific sources
|
|
|
|
## Comparison with Claude Desktop Research
|
|
|
|
| Feature | Claude Desktop | Deep Research Skill |
|
|
|---------|---------------|---------------------|
|
|
| Multi-source synthesis | ✅ | ✅ |
|
|
| Citation tracking | ✅ | ✅ |
|
|
| Iterative refinement | ✅ | ✅ |
|
|
| Source verification | ✅ | ✅ Enhanced |
|
|
| Credibility scoring | ❌ | ✅ |
|
|
| 8-phase methodology | ❌ | ✅ |
|
|
| Graph-of-Thoughts | ❌ | ✅ |
|
|
| Multiple modes | ❌ | ✅ |
|
|
| Local file integration | ❌ | ✅ |
|
|
| Code execution | ❌ | ✅ |
|
|
|
|
## 2025 Research Papers Implemented
|
|
|
|
This skill now incorporates cutting-edge techniques from 2025 academic research:
|
|
|
|
1. **Parallel Execution** (GAP, Flash-Searcher, TPS-Bench)
|
|
- DAG-based parallel tool use for independent subtasks
|
|
- 3-5x faster retrieval phase
|
|
- Concurrent search strategies
|
|
|
|
2. **First Finish Search** (arXiv 2505.18149)
|
|
- Quality threshold gates by mode
|
|
- Continue background searches for depth
|
|
- Optimal latency-accuracy tradeoff
|
|
|
|
3. **Citation Validation** (CiteGuard, arXiv 2510.17853)
|
|
- Hallucination pattern detection
|
|
- Multi-source verification (DOI + URL)
|
|
- Strict mode for critical reports
|
|
|
|
4. **Dynamic Outlines** (WebWeaver, arXiv 2509.13312)
|
|
- Evidence-driven structure adaptation
|
|
- Phase 4.5 refinement step
|
|
- Prevents locked-in research paths
|
|
|
|
5. **Attribution Gradients** (arXiv 2510.00361)
|
|
- Interactive evidence chains
|
|
- Hover tooltips in HTML reports
|
|
- Improved auditability
|
|
|
|
## Version
|
|
|
|
2.0 (2025-11-05) - Major update with 2025 research enhancements
|
|
1.0 (2025-11-04) - Initial release
|
|
|
|
## License
|
|
|
|
User skill - modify as needed for your workflow
|