Document Index Splitting (PDF Splitting)
Last updated: October 9, 2025
What it is
Document Index Splitting automatically detects the individual documents contained inside large “bundles” (e.g., court books, subpoena returns, disclosure bundles, email exports) and turns them into a structured Document Index with metadata. From there, you can export either a Word index or the split files themselves, named and ready to use in your DMS/PMS or on your computer.
Why it matters
Eliminate manual splitting/renaming of giant bundles
Find what you need faster with dates, descriptions, categories, relevance, and page ranges
Export cleanly into existing systems and workflows
What it does (at a glance)
When you upload a bundle (PDF, .eml, .msg, etc.), Mary:
Identifies individual documents contained within the bundle.
Builds a Document Index showing, for each detected document:
Date (where available)
Name / Title
Brief description
Category
Page range (from the original bundle)
Relevance rating
Links to view the original pages
Lets you select any or all items in the index.
Exports either:
A File Index (Word) matching what you see in Mary, or
The split files themselves (named and individually separated).
Sends exports to your Document/Practice Management System (if integrated) or downloads them to your computer.
Example: In a public test from the Novak Djokovic matter, four uploaded PDFs produced 76 individual documents (each with metadata and page ranges) ready to export.
Supported inputs
PDF bundles (single or multi-document)
Email archives: .eml and .msg (Outlook)
You can upload thousands of files; Mary will split, organise, categorise, and surface the most relevant items to your case.
How it works (workflow)
Upload your bundle
Drag-and-drop or upload via your matter. PDFs, .eml, and .msg supported.
Open the Document Index
Mary automatically analyses the bundle and lists identified documents with metadata. Use filters (e.g., date or relevance) to narrow your view.
Verify details
Click any row to preview the source pages. Confirm titles, dates, categories, and page ranges.
Select items
Choose individual documents or Select All.
Export
File Index (Word): Creates a simple, shareable index (names, categories, page ranges, etc.).
Split Files: Generates individually named files per index row.
Choose destination
Integrated DMS/PMS: Export directly to supported partners.
Local download: Save split files to your computer.
Naming & ordering
Current behaviour:
Default naming uses the document name and date (when available).
You can select any number of documents to export and adjust the export order.
Coming soon:
Custom naming conventions you can define yourself.
Working with existing matters
If a matter already shows an Index View, you can open it immediately, select the documents you need, and export without reprocessing.
Export options in detail
1) File Index (Word)
Creates a concise table that mirrors the Mary index (name, category, page range, etc.).
Ideal for briefs, instructions to counsel, or sharing with colleagues.
2) Split Files
Produces fully separated, named documents for downstream workflows.
Deliver to your DMS/PMS (if integrated) or download locally.
Tips & best practices
Filter by relevance/date to prioritise critical documents.
Spot-check page ranges before exporting to ensure splits align with your requirements.
Export the Word index early for quick team alignment; export split files once you’re ready to proceed.
FAQs
Is this included in my plan?
Yes. Document Index Splitting is included in current Mary subscriptions.
Can I change the naming convention?
Not yet, but custom naming is coming soon. Today, names are based on document name and date where available.
Can I export only a subset?
Yes. Select any set of items and export just those.
Where can I export to?
Either directly to your DMS/PMS (with supported integrations) or as a local download.
Does Mary keep the link to original pages?
Yes. the index shows the original page ranges and lets you open the source pages.
Troubleshooting
A document was misidentified → Open the row, review the preview and page range. If something looks off, note the details and send feedback via in‑app chat so we are aware of this.
I don’t see export options → Ensure you’ve selected one or more index items. For DMS/PMS export, confirm the integration is connected for your org.
Large bundles take time → You can navigate away; processing continues in the background and you’ll see the index when ready.
Give feedback
We’re actively evolving this feature (custom naming, additional formats, richer metadata). Share ideas via in‑app chat or book a quick call with the team.