• The ARTFL Project
  • PhiloLogic4
  • Subscription Information
  • University of Chicago
  • ATILF - CNRS

ARTFL Encyclopédie - Robert Morrissey, General Editor; Glenn Roe, Assoc. Editor

  • Search
  • Front Matter
  • Introduction
  • Encyclopédistes
Home

Table of Contents

  • Research & Archival Materials
  • The "18th" Volume
  • PhiloLogic3 Search
  • Navigational Tools
  • Citing the Encyclopédie
  • Encyclopédie Collaborations
  • Supplément de Panckoucke
  • Working Papers
  • Front Matter
  • Editor's Introduction
  • The Encyclopédistes
  • Development Team
  • Contact Us

ARTFL Links

  • The ARTFL Project
  • PhiloLogic4
  • Subscription Information
  • University of Chicago
  • ATILF - CNRS

Sequence Alignment

Sequence Alignment/PAIR: Pairwise Alignment for Intertextual Relations

In 2009, ARTFL celebrated an open source software release of PAIR (Pairwise Alignment for Intertextual Relations) with an alpha version of PhiloLine available for download at Google Code. PAIR is designed as powerful search tool to help scholars tackle and better understand the widespread problem of literary text reuse.

While PAIR was developed in response to the fairly specific phenomenon of similar passages across literary works, the sequence analysis techniques employed in PAIR were developed in widely disparate fields, such as bioinformatics and computer science, with applications ranging from genome sequencing to plagiarism detection. PAIR generates a set of overlapping word sequence shingles for every text in a corpus, then stores and indexes that information to be analyzed against shingles from other texts.

Common shingles across texts indicate many different types of textual borrowings, from direct citations to more ambiguous and unattributed usages of a passage. Using the below search form, the user can quickly identify similar passages shared between the Encyclopédie and the 3,500+ works included in the ARTFL-FRANTEXT database (Note: ARTFL-FRANTEXT is a subscription database, and as such full-text results cannot be displayed. For more information on ARTFL subscriptions services, please visit our Subscription Details page.

Interested parties are encouraged to consult the release site for more documentation, including technical details, PhiloLine source downloads, and a freestanding Perl module.

Find similar passages between Diderot and d'Alembert's Encyclopédie and the ARTFL-FRANTEXT database. By selecting a "Match Size" parameter, the user can further narrow the search results to look for shared passages of specific lengths.

Encyclopédie
Author: (e.g., Jaucourt)
Headword: (e.g., Courage)
Matching Text: (e.g., gloire)
Match Size: (e.g., 20-)
Enter for example 20- here and it will match results of 20 or more words

 ARTFL-FRANTEXT

Author: (e.g., Montesquieu)
Title: (e.g., Histoire)
Date/Year: (e.g., 1700-1725)
Matching Text: (e.g., gloire)
Match Size: (e.g., 20-)
Enter for example 20- here and it will match results of 20 or more words

Sort Results by:

  


Note: dates range work as usual. Match Size: to get longer matches, 100- means 100 or more words.

The ARTFL Project
Department of Romance Languages and Literatures
University of Chicago
1115 East 58th Street Chicago, IL 60637
tel: 773-702-8488 | email: artfl[at]artfl[dot]uchicago[dot]edu
Privacy Notice

  • Search
  • Front Matter
  • Introduction
  • Encyclopédistes