A diagram illustrating a tree search framework outperforming vector search, achieving 98.7% success. [alttext.ai](https://alt

Editorial illustration for Tree search framework achieves 98.7% success on docs where vector search fails

Tree Search Beats Vector Retrieval with 98.7% Success

Tree search framework achieves 98.7% success on docs where vector search fails

January 30, 2026 • 2 min read

Vector‑based retrieval has become the default for most enterprise search tools, but its strength is also its weakness: it leans on dense embeddings that capture a document’s overall theme, not the fine‑grained steps a user may need to follow. When a question requires chaining several pieces of evidence—think “Which regulation changed after the 2022 privacy amendment and how does it affect data‑retention policies?”—the embeddings often collapse the nuance into a single point, and the system returns irrelevant hits. Researchers have been probing alternatives, and a new tree‑search framework now reports a 98.7 % success rate on the subset of queries where traditional vector search falls short.

The numbers suggest a structural shift: instead of relying on a holistic similarity score, the method breaks the problem into bite‑size, context‑light prompts that can be recombined. That change matters because it directly tackles the “multi‑hop” reasoning gap that has long hampered AI‑assisted research and compliance work.

---

The system matches documents against a short, decontextualized query rather than the full history of the problem the user is trying to solve. Solving the multi‑hop reasoning problem…

The system matches documents against a short, decontextualized query rather than the full history of the problem the user is trying to solve. Solving the multi-hop reasoning problem The real-world impact of this structural approach is most visible in "multi-hop" queries that require the AI to follow a trail of breadcrumbs across different parts of a document. In a recent benchmark test known as FinanceBench, a system built on PageIndex called "Mafin 2.5" achieved a state-of-the-art accuracy score of 98.7%. The performance gap between this approach and vector-based systems becomes clear when analyzing how they handle internal references.

This tree search framework hits 98.7% on documents where vector search fails - VentureBeat AI

PageIndex delivers a striking 98.7 % success rate on documents that stump traditional vector search. The gain is notable. Yet the numbers come from a narrow set of failure cases, leaving open whether the same gain appears across broader corpora.

By discarding the chunk‑and‑embed pipeline, the framework re‑imagines retrieval as a navigational tree walk, aligning short, decontextualized queries with document structure rather than semantic similarity scores. This shift appears to unlock multi‑hop reasoning, where an answer depends on traversing several linked passages. The open‑source nature of the project invites community scrutiny, but the article does not detail runtime costs or how the approach scales to collections beyond the tested subset.

Consequently, while the reported accuracy is impressive, it remains unclear whether the method can replace vector databases in production environments or if it will coexist as a complementary tool for particularly long texts. Future benchmarks that compare end‑to‑end latency and resource usage will be needed to assess the practical trade‑offs of this tree‑search paradigm.

Common Questions Answered

How does Mafin 2.5 improve upon traditional vector-based RAG for financial document analysis?

Mafin 2.5 uses the PageIndex framework to preserve document structure and reasoning, moving beyond simple vector similarity. Instead of chunking documents arbitrarily, it maintains the hierarchical relationships between sections, tables, and footnotes, allowing for more precise and contextually aware document retrieval.

What are the key limitations of traditional vector-based RAG that PageIndex aims to solve?

Traditional vector RAG struggles with loss of document structure, semantic ambiguity, and opaque retrieval processes. The PageIndex framework addresses these issues by treating documents as structured trees, enabling dynamic navigation and reasoning that mimics how a human expert would analyze complex financial documents.

What benchmark performance did Mafin 2.5 achieve using the PageIndex framework?

Mafin 2.5 achieved an impressive 98.7% accuracy on the FinanceBench benchmark, significantly outperforming traditional vector-based RAG systems. This breakthrough demonstrates the potential of reasoning-driven search approaches in extracting precise insights from complex financial documents like SEC filings and earnings reports.

🎓

Featured Review

No Code MBA

Build AI apps without coding. Our in-depth course review.

Read Review

Tree Search Beats Vector Retrieval with 98.7% Success

Further Reading

Common Questions Answered

How does Mafin 2.5 improve upon traditional vector-based RAG for financial document analysis?

What are the key limitations of traditional vector-based RAG that PageIndex aims to solve?

What benchmark performance did Mafin 2.5 achieve using the PageIndex framework?

Most Popular

Dfinity's Caffeine AI Builds Apps Through Conversation

Mistral AI launches Forge to let firms build proprietary AI models

Pentagon embeds Claude, sole cleared AI, into classified tech amid culture wars

Random Labs releases Slate V1, swarm‑native coding agent with OS‑style memory

Alibaba sees key Qwen AI staff exit after Qwen3.5 open-source release

Google rolls out Gemini AI to all US users, free tier gets Personal Intelligence

Qualcomm's Elite chip targets AI wearables such as pendants, pins, and glasses

Google launches Gemini 3.1 Flash Lite, priced at one‑eighth of Gemini 3.1 Pro

OpenAI launches GPT-5.4 in standard, Pro, and Thinking versions

OpenClaw Superfan Meetup Highlights Optimism, Lobster and Varied Interests

Further Reading

Related Reading

Court rules AI model contains no copies, limits Getty watermark claim

LexisNexis CEO: AI now core legal infrastructure, like email or word processor

California enacts first U.S. regulations for AI companion chatbots

LexisNexis Chief Says AI Systems for JudgeGPT Have Been Significantly Advanced

Google to Pay USD 68 Million to Settle Lawsuit Over Assistant’s Unlawful Recording

Common Questions Answered

How does Mafin 2.5 improve upon traditional vector-based RAG for financial document analysis?

What are the key limitations of traditional vector-based RAG that PageIndex aims to solve?

What benchmark performance did Mafin 2.5 achieve using the PageIndex framework?

Most Popular

Dfinity's Caffeine AI Builds Apps Through Conversation

Mistral AI launches Forge to let firms build proprietary AI models

Pentagon embeds Claude, sole cleared AI, into classified tech amid culture wars

Random Labs releases Slate V1, swarm‑native coding agent with OS‑style memory

Alibaba sees key Qwen AI staff exit after Qwen3.5 open-source release

Google rolls out Gemini AI to all US users, free tier gets Personal Intelligence

Qualcomm's Elite chip targets AI wearables such as pendants, pins, and glasses

Google launches Gemini 3.1 Flash Lite, priced at one‑eighth of Gemini 3.1 Pro

OpenAI launches GPT-5.4 in standard, Pro, and Thinking versions

OpenClaw Superfan Meetup Highlights Optimism, Lobster and Varied Interests