Document Index Splitting (PDF Splitting)

Last updated: October 9, 2025


What it is

Document Index Splitting automatically detects the individual documents contained inside large “bundles” (e.g., court books, subpoena returns, disclosure bundles, email exports) and turns them into a structured Document Index with metadata. From there, you can export either a Word index or the split files themselves, named and ready to use in your DMS/PMS or on your computer.


Why it matters

  • Eliminate manual splitting/renaming of giant bundles

  • Find what you need faster with dates, descriptions, categories, relevance, and page ranges

  • Export cleanly into existing systems and workflows


What it does (at a glance)

When you upload a bundle (PDF, .eml, .msg, etc.), Mary:

  1. Identifies individual documents contained within the bundle.

  2. Builds a Document Index showing, for each detected document:

    • Date (where available)

    • Name / Title

    • Brief description

    • Category

    • Page range (from the original bundle)

    • Relevance rating

    • Links to view the original pages

  3. Lets you select any or all items in the index.

  4. Exports either:

    • A File Index (Word) matching what you see in Mary, or

    • The split files themselves (named and individually separated).

  5. Sends exports to your Document/Practice Management System (if integrated) or downloads them to your computer.

Example: In a public test from the Novak Djokovic matter, four uploaded PDFs produced 76 individual documents (each with metadata and page ranges) ready to export.


Supported inputs

  • PDF bundles (single or multi-document)

  • Email archives: .eml and .msg (Outlook)

You can upload thousands of files; Mary will split, organise, categorise, and surface the most relevant items to your case.


How it works (workflow)

  1. Upload your bundle

    • Drag-and-drop or upload via your matter. PDFs, .eml, and .msg supported.

  2. Open the Document Index

    • Mary automatically analyses the bundle and lists identified documents with metadata. Use filters (e.g., date or relevance) to narrow your view.

  3. Verify details

    • Click any row to preview the source pages. Confirm titles, dates, categories, and page ranges.

  4. Select items

    • Choose individual documents or Select All.

  5. Export

    • File Index (Word): Creates a simple, shareable index (names, categories, page ranges, etc.).

    • Split Files: Generates individually named files per index row.

  6. Choose destination

    • Integrated DMS/PMS: Export directly to supported partners.

    • Local download: Save split files to your computer.


Naming & ordering

  • Current behaviour:

    • Default naming uses the document name and date (when available).

    • You can select any number of documents to export and adjust the export order.

  • Coming soon:

    • Custom naming conventions you can define yourself.


Working with existing matters

If a matter already shows an Index View, you can open it immediately, select the documents you need, and export without reprocessing.


Export options in detail

1) File Index (Word)

  • Creates a concise table that mirrors the Mary index (name, category, page range, etc.).

  • Ideal for briefs, instructions to counsel, or sharing with colleagues.

2) Split Files

  • Produces fully separated, named documents for downstream workflows.

  • Deliver to your DMS/PMS (if integrated) or download locally.


Tips & best practices

  • Filter by relevance/date to prioritise critical documents.

  • Spot-check page ranges before exporting to ensure splits align with your requirements.

  • Export the Word index early for quick team alignment; export split files once you’re ready to proceed.


FAQs

Is this included in my plan?
Yes. Document Index Splitting is included in current Mary subscriptions.

Can I change the naming convention?
Not yet, but custom naming is coming soon. Today, names are based on document name and date where available.

Can I export only a subset?
Yes. Select any set of items and export just those.

Where can I export to?
Either directly to your DMS/PMS (with supported integrations) or as a local download.

Does Mary keep the link to original pages?
Yes. the index shows the original page ranges and lets you open the source pages.


Troubleshooting

  • A document was misidentified → Open the row, review the preview and page range. If something looks off, note the details and send feedback via in‑app chat so we are aware of this.

  • I don’t see export options → Ensure you’ve selected one or more index items. For DMS/PMS export, confirm the integration is connected for your org.

  • Large bundles take time → You can navigate away; processing continues in the background and you’ll see the index when ready.


Give feedback

We’re actively evolving this feature (custom naming, additional formats, richer metadata). Share ideas via in‑app chat or book a quick call with the team.