Home | Accounts | Credentials | Peers | Projects | Upload | De-duplicate | Cluster | View | Browse | Search | Buckets | Datasets | Assign | Notifications | Toolbox | Code | Bookmarks | Validate | Report | FAQ | Contact

Duplicate detection and clustering

It sure does. If you currently have an FDMS bulk download, or a large collection emails in Lotus Notes or Microsoft Outlook and you suspect it has mass email campaign duplicates, you can now run the de-duplication and clustering algorithms inside PCAT. Here is an example:



Users of PCAT can relax the threshold for identifying duplicates and sample “clusters” of near-duplicate comments generated by form letter and talking point campaigns. This step makes it easier to both document the dimensions of the central “talking points” while also focusing attention on unique or otherwise unexpected contributions.



Most Frequently Asked Questions

 
C:/PCAT/new-pcat-help/data/data/pages/does_pcat_identify_duplicates.txt · Last modified: 2010/06/10 14:01 by stu
 
Except where otherwise noted, content on this wiki is licensed under the following license:CC Attribution-Noncommercial-Share Alike 3.0 Unported
Recent changes RSS feed Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki