Skip to main content
A diagram illustrating a tree search framework outperforming vector search, achieving 98.7% success. [alttext.ai](https://alt

Editorial illustration for Tree search framework achieves 98.7% success on docs where vector search fails

Tree Search Beats Vector Retrieval with 98.7% Success

Tree search framework achieves 98.7% success on docs where vector search fails

2 min read

Vector‑based retrieval has become the default for most enterprise search tools, but its strength is also its weakness: it leans on dense embeddings that capture a document’s overall theme, not the fine‑grained steps a user may need to follow. When a question requires chaining several pieces of evidence—think “Which regulation changed after the 2022 privacy amendment and how does it affect data‑retention policies?”—the embeddings often collapse the nuance into a single point, and the system returns irrelevant hits. Researchers have been probing alternatives, and a new tree‑search framework now reports a 98.7 % success rate on the subset of queries where traditional vector search falls short.

The numbers suggest a structural shift: instead of relying on a holistic similarity score, the method breaks the problem into bite‑size, context‑light prompts that can be recombined. That change matters because it directly tackles the “multi‑hop” reasoning gap that has long hampered AI‑assisted research and compliance work.

---

The system matches documents against a short, decontextualized query rather than the full history of the problem the user is trying to solve. Solving the multi‑hop reasoning problem…

The system matches documents against a short, decontextualized query rather than the full history of the problem the user is trying to solve. Solving the multi-hop reasoning problem The real-world impact of this structural approach is most visible in "multi-hop" queries that require the AI to follow a trail of breadcrumbs across different parts of a document. In a recent benchmark test known as FinanceBench, a system built on PageIndex called "Mafin 2.5" achieved a state-of-the-art accuracy score of 98.7%. The performance gap between this approach and vector-based systems becomes clear when analyzing how they handle internal references.

Related Topics: #Vector Search #Tree Search Framework #Multi-hop Reasoning #Enterprise Search #AI Retrieval #Document Embeddings #Semantic Search #FinanceBench #Context Retrieval

PageIndex delivers a striking 98.7 % success rate on documents that stump traditional vector search. The gain is notable. Yet the numbers come from a narrow set of failure cases, leaving open whether the same gain appears across broader corpora.

By discarding the chunk‑and‑embed pipeline, the framework re‑imagines retrieval as a navigational tree walk, aligning short, decontextualized queries with document structure rather than semantic similarity scores. This shift appears to unlock multi‑hop reasoning, where an answer depends on traversing several linked passages. The open‑source nature of the project invites community scrutiny, but the article does not detail runtime costs or how the approach scales to collections beyond the tested subset.

Consequently, while the reported accuracy is impressive, it remains unclear whether the method can replace vector databases in production environments or if it will coexist as a complementary tool for particularly long texts. Future benchmarks that compare end‑to‑end latency and resource usage will be needed to assess the practical trade‑offs of this tree‑search paradigm.

Further Reading

Common Questions Answered

How does Mafin 2.5 improve upon traditional vector-based RAG for financial document analysis?

Mafin 2.5 uses the PageIndex framework to preserve document structure and reasoning, moving beyond simple vector similarity. Instead of chunking documents arbitrarily, it maintains the hierarchical relationships between sections, tables, and footnotes, allowing for more precise and contextually aware document retrieval.

What are the key limitations of traditional vector-based RAG that PageIndex aims to solve?

Traditional vector RAG struggles with loss of document structure, semantic ambiguity, and opaque retrieval processes. The PageIndex framework addresses these issues by treating documents as structured trees, enabling dynamic navigation and reasoning that mimics how a human expert would analyze complex financial documents.

What benchmark performance did Mafin 2.5 achieve using the PageIndex framework?

Mafin 2.5 achieved an impressive 98.7% accuracy on the FinanceBench benchmark, significantly outperforming traditional vector-based RAG systems. This breakthrough demonstrates the potential of reasoning-driven search approaches in extracting precise insights from complex financial documents like SEC filings and earnings reports.