C
Chris Hansen
I'm interested in finding some logic to help me
identify "near duplicates" in Outlook e-mail populations.
Basically, I'm interested in locating only the most
complete e-mail in a discussion thread chain. Now in many
cases that would be the latest e-mail, but not always,
since a user, in replying or forwarding, can alter the
contents of the foregoing chain. So a "near duplicate,"
by my definition, would contain the same or less content,
and nothing unique in that text. Anyone cracked this nut
before? Thanks much!
identify "near duplicates" in Outlook e-mail populations.
Basically, I'm interested in locating only the most
complete e-mail in a discussion thread chain. Now in many
cases that would be the latest e-mail, but not always,
since a user, in replying or forwarding, can alter the
contents of the foregoing chain. So a "near duplicate,"
by my definition, would contain the same or less content,
and nothing unique in that text. Anyone cracked this nut
before? Thanks much!