Welcome Guest, you are in: Login

Public Comment Analysis Toolkit Help Wiki

RSS RSS

Navigation





Search the wiki
»

Does PCAT identify duplicates?

RSS
Home | Accounts | Credentials | Peers | Projects | Upload | De-duplicate | Cluster | Tag Clouds | View | Browse | Search | Buckets | Datasets | Assign | Notifications | Toolbox | Code | Bookmarks | Validate | Report | FAQ | Service Levels | Ideas for PCAT Improvements | PCAT Wiki ToDo List | Contact

Duplicate detection and clustering



It sure does. If you currently have an FDMS bulk download, or a large collection emails in Lotus Notes or Microsoft Outlook and you suspect it has mass email campaign duplicates, you can now run the de-duplication and clustering algorithms inside PCAT. Here is an example:

Image

Users of PCAT can relax the threshold for identifying duplicates and sample “clusters” of near-duplicate comments generated by form letter and talking point campaigns. This step makes it easier to both document the dimensions of the central “talking points” while also focusing attention on unique or otherwise unexpected contributions.

Image
Image

Most Frequently Asked Questions

Why would I use this system? | Where do I get FDMS bulk downloads? | Does PCAT identify duplicates? | What is QDAP?

© 2009 - 2010 Qualitative Data Analysis Program (QDAP), in the University Center for Social and Urban Research, at the University of Pittsburgh, and QDAP-UMass, in the College of Social and Behavioral Sciences, at the University of Massachusetts Amherst. As of 2010, PCAT and this PCAT Help Wiki are maintained and improved by personnel from Texifter, LLC, which is a software start-up located in North Amherst & Springfield, MA and online at http://texifter.com/.

Content on this website was made possible with the following grants from the National Science Foundation: III-0705566 "Collaborative Research III-COR: From a Pile of Documents to a Collection of Information: A Framework for Multi-Dimensional Text Analysis" and IIS-0429293 "Collaborative Research: Language Processing Technology for Electronic Rulemaking." We are also grateful for financial support from the U.S. Environmental Protection Agency and the U.S. Fish & Wildlife Service. **Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect those of the National Science Foundation.** == ==

Home | Accounts | Credentials | Peers | Projects | Upload | De-duplicate | Cluster | Tag Clouds | View | Browse | Search | Buckets | Datasets | Assign | Notifications | Toolbox | Code | Bookmarks | Validate | Report | FAQ | Service Levels | Ideas for PCAT Improvements | PCAT Wiki ToDo List | Contact