Document Index
What it is
The Document Index automatically splits, categorises, and summarises large bundles of evidence—up to 10,000 pages—into a searchable document index. Each document within a bundle is assigned a clear name, a concise description, measure relevance to the matter and accurate page references. Users can view, triage, and export the index without manually reviewing every page.
Why it matters
Before legal analysis can begin, teams often spend hours manually renaming, converting, and sorting unsorted PDFs. This delays briefing, increases risk of error, and clutters case preparation with irrelevant or duplicated content.
The Document Index removes that burden. It enables:
- Faster triage of incoming bundles
- Cleaner separation of relevant vs irrelevant material
- Trustworthy citations with accurate page ranges
- Early insight for strategy, client instructions, and brief-building
Where to find the Document Index
When you open a matter in Mary, go to the Sources tab.
You’ll see the Document Index table showing each subdocument extracted from your bundle.
How it works
-
Upload
Upload any large bundle (up to 10,000 pages). No need to manually split files beforehand unless it is above the page limit or 500mg file limit.
-
Auto-Split & Classify
Mary automatically splits the bundle into sub-documents and classifies them by type (e.g. Medical → Discharge Summary, Court Docs → Affidavit).
-
Summarise & Tag
Each document is given a:
- Name (based on type and contents)
- Description (brief summary about what the document is about)
- Page range (exact location within the source bundle where the sub-document is located)
- Relevance rating (how relevant the document is to the matter. for context, a documents relevance is informed by the relevance of the events taken from said document)
- Source file reference (deep-link enabled)
-
Review & Filter
Use filters (by date, relevance, or source file) to rapidly find what matters and cull duplicates or irrelevant material.
-
Export
Export a .docx version of the index for inclusion in briefs or to serve to other parties. Page ranges help teams extract content manually if needed.
-
Incremental Updates Supported
New bundles can be added at any time. Mary integrates them into the existing index without reprocessing everything or duplicating prior entries.
Use cases
- Initial triage of new matter: Identify which documents are relevant before starting deeper review.
- Preparing a brief: Quickly export or reference only the strategic documents needed for counsel.
- New client or discovery uploads: When clients send new folders of disclosure material, quickly surface and review only the new documents added.
- Occupational disease files: Filter for keywords or relevance to locate documents referencing chemicals, diagnoses, or incidents.
- Duplication handling (coming soon): Consolidate duplicated documents.
Tips or Notes
- Descriptions are designed to surface key legal or medical insight (e.g., causation, diagnosis, author) at a glance.
- Treat the Document Index as a staging area before the Chronology—it helps ensure only the best material enters downstream workflows.
- Each document links back to its original source in the viewer—no more scrolling through massive PDFs to verify content.
- Even without immediate full export support, the .docx index with page ranges offers huge time savings.
Known limitations
- The system does not currently detect or merge duplicate sub-documents. You may need to manually exclude repeated material.
- Manual editing of sub-document names and page ranges is not supported in this initial version.
- Document filtering is applied at the source file level, not the sub-document level—for now.
Please note: if any of these are affecting you, reach out to Luke, our Head of Product (luke@marytechnology.com)
Coming soon
We’re actively improving Document Index with the following upcoming features:
- Grouped View: Sub-documents will be visually grouped under their original source files for easier navigation.
- Duplicate detection: Automatically flag repeated documents across bundles (e.g. same discharge summary sent twice).
- Sub-document filtering: Filter or sort by document type, relevance, or author across the full index.
- Editable names and page ranges: Rename or adjust sub-document boundaries to match your workflow or firm conventions.
- Custom document type assignment: Configure document type categories to match your firm’s DMS or personal classification needs.
- Export-to-folder: Save split documents back into categorised folders for DMS upload or service.
Feedback welcome
This is Version 1 of the Document Index. If something doesn’t look right, or you want a feature added (e.g. group by sender), contact support or use the in-app feedback option. We’re improving fast based on your input.
Let us know if you have any questions or queries by contacting our team directly, or you can always use support@marytechnology.com