Email dedup strategy

Last updated: 2026-05-16

Email dedup has three knobs: policy, scope, and BCC handling. Defaults work for most matters. Last reply plus unique catches forked threads with unique attachments. Global collapses cross-custodian copies. Default (ignore BCC) groups sender and BCC copies as one. Switch defaults when the case calls for it. Disclose your choices to opposing counsel in writing.

Deduplicate dialog: Emails tab

The three knobs that shape the dedup run.

Deduplicate
Policy
Scope
BCC handling

Options and fields

Policy

Policy decides which copies count as duplicates inside a thread. Two choices: Last reply only and Last reply plus unique (default).

Worked example. A five-email thread. Robin sends a contract draft. Jordan replies confirming section 4. Casey replies with a redline attached (unique attachment). Robin acknowledges. Robin sends v4 with everything quoted. Which copies stay?

Last reply only
Only the latest reply in the thread is kept. Everything earlier is hidden, including the mid-thread message with the unique attachment.
1
Robin VasquezMar 12, 9:14 AM
Attaching the draft for review. Let me know your thoughts on section 4 by Friday.Attachment: Draft_3.docx
2
Jordan HsuMar 13, 10:02 AM
Section 4 looks fine to me. Casey, anything from your end?
3
Casey LinMar 14, 11:33 AM
Two notes on indemnification. See the redline attached.Attachment: Draft_3_CL_redline.docx
4
Robin VasquezMar 15, 9:01 AM
Got it, working on a v4 today. Will send by EOD.
5
Robin VasquezMar 15, 4:47 PM
Kept
v4 attached. I incorporated Casey's redline. Take a look.Attachment: Draft_4.docx
Kept: email 5. 1 of 5 visible.
Last reply plus unique
The latest reply is kept, and so is any earlier message with a unique attachment or unique content not subsumed by later replies.
1
Robin VasquezMar 12, 9:14 AM
Attaching the draft for review. Let me know your thoughts on section 4 by Friday.Attachment: Draft_3.docx
2
Jordan HsuMar 13, 10:02 AM
Section 4 looks fine to me. Casey, anything from your end?
3
Casey LinMar 14, 11:33 AM
Kept
Two notes on indemnification. See the redline attached.Attachment: Draft_3_CL_redline.docx
4
Robin VasquezMar 15, 9:01 AM
Got it, working on a v4 today. Will send by EOD.
5
Robin VasquezMar 15, 4:47 PM
Kept
v4 attached. I incorporated Casey's redline. Take a look.Attachment: Draft_4.docx
Kept: email 3 and 5. 2 of 5 visible.

Last reply onlykeeps just email 5. Casey's redline attachment vanishes from review because email 5 quotes the body text but doesn't carry the attached file. Fast, aggressive, lossy. Last reply plus uniquekeeps email 3 and email 5. Casey's redline survives because it's a unique artifact. Use this default unless you've negotiated a last-only protocol with opposing counsel. The Sedona Conference and EDRM Processing Standards align with the last-plus-unique pattern for inclusive-only review.

Scope

Scope decides whether dedup runs across the whole case or per custodian. The two choices map to two case shapes.

AspectGlobalPer custodian
How dedup worksRuns across the whole case at once. A thread is collapsed once, no matter how many custodians' mailboxes it landed in.Runs independently per custodian. Each custodian's copy of a thread is dedup'd in isolation.
Reduction ratioHigher. Cross-custodian copies collapse.Lower. Each custodian retains their own thread copy.
Custodian provenanceCaptured on the inclusive copy as a list of all custodians whose copy was subsumed.Preserved per custodian. Each custodian's thread sequence stays distinct.
Use whenMulti-party litigation where the conversation matters more than who's holding which copy.Internal investigations where each custodian's exact sequence matters (who saw what, when).

Use Global for multi-party litigation. The production set is what matters, not who happened to hold which copy. Use Per custodianfor internal investigations where each custodian's exact sequence matters. If you're building a timeline of who saw what and when, you don't want one custodian's inclusive copy absorbing everyone else's.

BCC handling

BCC handling decides whether differences in the BCC list split a dedup group. Two choices.

AspectDefault (ignore BCC)Strict (include BCC)
What counts as a duplicateSender's sent copy and the BCC recipient's received copy collapse together. BCC is invisible to dedup.Sender's copy and BCC recipient's copy stay distinct. BCC differences split the dedup group.
Reduction ratioHigher. Fewer dedup groups.Lower. More dedup groups retained.
Trade-offYou see one copy of an email that may have had multiple BCC routes. Saves review time.You see each BCC-distinct copy. Useful when establishing who actually received the message matters for the case.
Use whenMost matters. BCC rarely changes the substantive content that matters for review.Investigations where the BCC list is itself probative (for example, secretly informing a third party).

Default for most matters. Switch to Strict when the BCC list is itself evidence (someone secretly informing a third party, for instance). The trade-off is a lower reduction ratio and more emails to review.

Defensibility

Email thread dedup is widely used but should be disclosed to opposing counsel and noted in the ESI protocol. Courts have upheld threading when parties agree to it upfront in writing. Courts have ordered full production of non-inclusive emails when parties did not negotiate the practice first.

The Sedona Conference Cooperation Proclamation and the EDRM Processing Standards both recommend documenting dedup choices in the ESI protocol before collection ends. In re Tecfidera and In re Actos are the two cases practitioners cite most often: both turned on whether the producing party disclosed the dedup practice in advance.

Always disclose your dedup choices in writing before production. Pin the policy, scope, and BCC mode in the ESI protocol and keep the proportionality argument ready: thread dedup reduces review burden without losing evidence. The hidden copies stay in the case record and remain available for production-side reconciliation if challenged.

Edge cases

  • A forwarded thread with edited quoted text counts as unique content under Last reply plus unique. Hintyr keeps that copy.
  • Mid-thread attachments not re-attached in later replies are unique. The policy default keeps the message that carries them.
  • A thread split into a forward chain and a reply chain is two inclusive copies. Both survive Last reply plus unique.
  • Per custodian scope doesn't collapse cross-custodian duplicates. If two custodians both received the same external email, you'll see both copies. Use Global if that's what you want.

Frequently asked questions

Should I use 'last reply only' or 'last reply plus unique'?
Use 'last reply plus unique' (the default) unless you've negotiated a stricter protocol with opposing counsel. 'Last reply plus unique' keeps any mid-thread message with a unique attachment or content that isn't quoted in a later reply, so you don't lose evidence to the dedup pass. 'Last reply only' is more aggressive: it keeps just the final reply and hides everything earlier, including mid-thread messages that carry unique attachments. Use it only when the volume reduction is worth the loss and the protocol allows it.
Will opposing counsel know I deduplicated emails?
They should, because you should tell them in the ESI protocol. Email thread dedup is widely used and generally defensible, but courts have ordered full production of non-inclusive emails when the producing party didn't negotiate the practice in writing first. Sedona Conference guidance and the EDRM Processing Standards both recommend documenting dedup choices upfront. Disclose the policy, scope, and BCC mode in writing before production. In re Tecfidera and In re Actos are the cases practitioners cite when this issue comes up.
When should I switch from Global scope to Per custodian?
Per custodian is right when each custodian's exact email sequence matters: internal investigations, key-player timelines, or any matter where who-saw-what-when is itself probative. Global is right for multi-party litigation where the conversation is what matters. Default is Global because most matters fit that shape.
When does BCC handling actually make a difference?
When the BCC list is evidence. Most matters, BCC is invisible to the substantive review and Default (ignore BCC) is the right choice. If you're investigating whether someone secretly informed a third party, switch to Strict so each BCC-distinct copy stays in review.