Update (2015-09-29): Various improvements!

  • The comparison algorithm now uses 5-grams rather than 3-grams, which should reduce the number of false positives when duplication consists of a lot of small phrases.
  • Exclusion detection is a little smarter; some mirrors are detected by HTML tags.
  • Excluded URLs are visible in the list of checked sources.
  • Long sequences of text inside of template paramaters are treated as part of the article.
  • Long URLs and page content no longer break the display.
  • There is only one background per day rather than one for each worker.

Please let me know if any of these changes are causing problems or not working as expected.

The given page doesn't seem to exist: Allan McLean (philanthropist), McLean's Mansion.

This tool attempts to detect copyright violations in articles. In search mode, it will check for similar content elsewhere on the web using Yahoo! BOSS and/or external links present in the text of the page, depending on which options are selected. In comparison mode, the tool will skip the searching step and display a report comparing the article to the given webpage, like the Duplication Detector.

Running a full check can take up to 45 seconds if other websites are slow. Please be patient. If you get a timeout, wait a moment and refresh the page.

Specific websites can be skipped (for example, if their content is in the public domain) by being added to the excluded URL list.

Site: https:// . .org
Page title: or revision ID: