Deep Research With Trusted Sources

Deep research with AI tools is genuinely useful. It’s also genuinely prone to producing confident-sounding garbage — wrong facts, hallucinated citations, outdated information stated as current, real sources with fabricated quotes. The failure mode is dangerous precisely because it’s invisible at first glance. A well-formatted AI research summary looks authoritative even when it’s completely wrong.

The solution isn’t to avoid AI research tools. It’s to build a workflow that uses them for what they’re good at — synthesis, structure, initial orientation — while keeping humans responsible for source verification. Here’s what that workflow looks like in practice.

The Source-First Principle

The most important rule in AI-assisted research: never let the AI choose your sources. This sounds counterintuitive when you’re using a tool specifically designed to synthesize information. But source selection is where AI research fails most often and most consequentially.

Models trained on web data reflect the distribution of web content — which means authoritative academic sources are underrepresented relative to blog posts, forums, and SEO-optimized aggregator content. When you let an AI synthesize from its training data, you’re synthesizing from that same distribution. When you provide verified primary sources and ask the AI to synthesize those, you’re synthesizing from a much higher-quality information base.

The workflow reframes AI as a synthesis engine, not a research engine. You find and verify sources. The AI processes and structures them. That division of labor produces reliable outputs.

The Four-Phase Workflow

Phase 1: Orientation (AI-assisted)

Start with a broad orientation prompt to a capable model: “What are the main schools of thought on [topic]? What are the key debates? Who are the primary researchers? What are the landmark papers or publications?” Use this output as a map, not as a fact base. You’re identifying what to look for, not establishing what’s true.

The orientation phase is where AI delivers most of its research value. It collapses the “I don’t know what I don’t know” phase from days to minutes. A researcher new to a domain can get a working map of the intellectual landscape in a single conversation. Treat the map as provisional — it will have errors — and use it to direct your next phase.

Phase 2: Source Collection (Human-led)

Take the names, papers, publications, and debates from your orientation and verify them through primary sources. Google Scholar, Semantic Scholar, PubMed, institutional repositories, and official publications are your tools here. For industry research: primary company publications, regulatory filings, and original technical papers rather than press coverage of those papers.

Verify every source the AI mentioned. Don’t assume cited papers exist. Don’t assume quoted statistics appear in cited documents. Check. The verification step takes time and is the entire point — it’s what produces research you can actually rely on and share.

Build a source library during this phase. Store verified documents with clear provenance: author, publication, date, URL or DOI, and your confidence in the source’s reliability. This library becomes the AI’s input in the next phase.

Phase 3: Deep Synthesis (AI-assisted)

Now bring in your verified sources. Upload documents or paste key sections directly into the context. The prompt structure that works: “Based on these documents [insert documents], what does the evidence show about [specific question]? Where do sources agree? Where do they conflict? What are the strongest arguments on each side?”

Using tools like Claude’s project files or a context-window-aware model means you can work with 10-20 substantial documents in a single session. The AI performs genuine synthesis — finding patterns across documents, identifying contradictions, tracing how thinking evolved over time — at a speed no human researcher can match.

The key instruction to include: “Cite the specific document and section for every claim you make.” This makes verification of the synthesis output tractable. Any claim you want to use can be traced back to a specific source you’ve already verified.

Phase 4: Verification and Gap-Filling (Human-led)

Read the synthesis critically. For every significant claim, trace it to the source document. For every source citation, check that the cited document actually supports the claim as stated — not a paraphrased version of what the document says, but the actual claim.

Identify gaps: what questions remain unanswered by your source library? These are either research questions worth pursuing or places where your research needs to acknowledge uncertainty rather than assert conclusions. The gaps are as important as the findings.

The Tools That Work

For orientation: any capable frontier model with a large context window. Claude, GPT-4o, and Gemini all work. Focus on models that hedge well — that say “I’m uncertain about this” rather than making up confident answers.

For document synthesis: models with large context windows are essential. Claude 3 Opus and Claude Sonnet handle 100,000+ tokens of document context effectively. Gemini 1.5 Pro handles even larger document sets. For very large source libraries, a RAG (retrieval-augmented generation) setup — where the AI retrieves relevant passages from indexed documents rather than holding everything in context — is more reliable than stuffing everything into one context window.

For source discovery: Perplexity’s academic mode with citations enabled, Consensus for scientific claims, and Elicit for empirical research questions. These tools provide AI synthesis with primary source links — a middle ground between pure model synthesis and human-led source collection. Still verify. But the verification burden is lower because sources are linked rather than fabricated.

What This Workflow Produces

Research that you can actually stake something on. Not a long document that looks authoritative. Research where you can point to every significant claim and show exactly where it comes from and why you believe the source.

The AI-assisted workflow produces that output faster than traditional research methods — meaningfully faster, not incrementally faster. The orientation phase alone saves days for researchers entering unfamiliar domains. The synthesis phase processes document sets that would take a human researcher a week to read in a few hours.

But the speed only matters if the output is reliable. That reliability comes from the source-first discipline — from treating the AI as a synthesis engine that processes your verified inputs, not an oracle that generates facts from training data. The workflow is the thing. The tools are secondary.