Deduplication

Last updated: 2026-03-23

The Deduplicate Files tool identifies duplicate files across your entire case and groups them together. Review each group, choose which copy to keep, and remove the rest. Access this tool from the Case Menu in the top navigation bar.

Deduplication dialog

Review duplicate groups and choose which files to keep

Duplicate Files Detected

3 groups of duplicate files found (7 files total). Choose how to handle each group.

Group 1 - Contract_MSA_2024.pdf
  • Contract_MSA_2024.pdf (2.3 MB, uploaded Jan 15, 2025)Keep
  • Contract_MSA_2024 (1).pdf (2.3 MB, uploaded Feb 03, 2025)
Group 2 - Deposition_Smith.pdf
  • Deposition_Smith.pdf (5.1 MB, uploaded Dec 20, 2024)Keep
  • Deposition_Smith_copy.pdf (5.1 MB, uploaded Jan 05, 2025)
  • Smith_Deposition_Final.pdf (5.1 MB, uploaded Jan 10, 2025)
Group 3 - Financial_Records_Q3.xlsx
  • Financial_Records_Q3.xlsx (1.8 MB, uploaded Nov 01, 2024)Keep
  • Financial_Records_Q3 (2).xlsx (1.8 MB, uploaded Nov 15, 2024)
CancelRemove 4 Duplicates

How to access deduplication

Deduplication is a Case Menu tool, not a settings panel. Open the Case Menu from the top navigation bar and select Deduplicate Files. Hintyr scans all files in the case and identifies groups of duplicates.

How file deduplication works

Hintyr compares files based on content to identify exact duplicates, even when file names differ. Files with identical content are grouped together. Each group shows:

  • File name - The name of each duplicate file.
  • File size - The size of the file for verification.
  • Upload date - When the file was uploaded to help identify the original.
  • Keep badge - The file recommended to keep (typically the earliest upload).

Choosing which duplicate files to keep

By default, Hintyr selects the earliest uploaded copy in each group as the file to keep. You can change this selection by clicking on a different file within the group. Files not marked as "Keep" will be removed when you confirm.

Removing duplicate files

After reviewing the groups, click the remove button to delete unselected duplicates. The dialog shows how many files will be removed. This action is permanent. Kept files retain all their metadata, tags, custodian assignments, redactions, and Bates numbers.

When to run deduplication

Deduplication is most useful in these scenarios:

  • After importing a large batch of files where the same document may have been collected from multiple sources.
  • When multiple team members upload files independently and some overlap.
  • Before production to ensure each document appears only once.
  • When consolidating files from multiple custodians who shared the same documents.

Frequently asked questions

Can I undo deduplication after removing files?
No. Removing duplicates is permanent. Review each group carefully before confirming. If you accidentally remove a file, you will need to re-upload it.
Does deduplication compare file content or just names?
Deduplication compares file content, so files with different names but identical content will be identified as duplicates.
What happens to tags and redactions on removed files?
Tags, redactions, Bates numbers, and custodian assignments on removed files are deleted along with the files. The kept file retains all its metadata.
Can I run deduplication multiple times?
Yes. You can run deduplication at any time. If no duplicates are found, the tool reports that no duplicate groups were detected.

Related articles