Mining eighteenth century ontologies: Machine learning and knowledge classification in the encyclopédie

dc.contributor.authorHorton, Russell
dc.contributor.authorMorrissey, Robert
dc.contributor.authorOlsen, Mark
dc.contributor.authorRoe, Glenn
dc.contributor.authorVoyer, Robert
dc.date.accessioned2014-10-02T06:17:30Z
dc.date.available2014-10-02T06:17:30Z
dc.date.issued2009
dc.date.updated2015-12-08T10:45:47Z
dc.description.abstractThe Encyclopédie of Denis Diderot and Jean le Rond d'Alembert was one of the most important and revolutionary intellectual products of the French Enlightenment. Mobilizing many of the great – and the notsogreat – philosophes of the 18th century, the Encyclopédie was a massive reference work for the arts and sciences, which sought to organize and transmit the totality of human knowledge while at the same time serving as a vehicle for critical thinking. In its digital form, it is a highly structured corpus; some 55,000 of its 77,000 articles were labeled with classes of knowledge by the editors making it a perfect sandbox for experiments with supervised learning algorithms. In this study, we train a Naive Bayesian classifier on the labeled articles and use this model to determine class membership for the remaining articles. This model is then used to make binary comparisons between labeled texts from different classes in an effort to extract the most important features in terms of class distinction. Reapplying the model onto the original classified articles leads us to question our previous assumptions about the consistency and coherency of the ontology developed by the Encyclopedists. Finally, by applying this model to another corpus from 18th century France, the Journal de Trévoux, or Mémoires pour l'Histoire des Sciences & des BeauxArts, new light is shed on the domain of Literature as it was understood and defined by 18th century writers.
dc.format15 pages
dc.identifier.issn1938-4122
dc.identifier.urihttp://hdl.handle.net/1885/12101
dc.publisherThe Alliance of Digital Humanities Organizations
dc.rightshttp://www.digitalhumanities.org/dhq/vol/3/2/000044/000044.html " This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License: http://creativecommons.org/licenses/by-nc-nd/3.0/ " From Publisher's website as at 1/10/2014
dc.sourceDigital Humanities Quarterly 3.2 (2009)
dc.source.urihttp://www.digitalhumanities.org/dhq/vol/3/2/000044/000044.htmlen_AU
dc.subjectmachine learning
dc.subjectencyclopedie
dc.subjectdigital humanities
dc.titleMining eighteenth century ontologies: Machine learning and knowledge classification in the encyclopédie
dc.typeJournal article
local.bibliographicCitation.issue2
local.bibliographicCitation.lastpage15
local.bibliographicCitation.startpage1
local.contributor.affiliationRoe, g, ANU Research School of Humanities & the Artsen_AU
local.contributor.authoruidu5455391en_AU
local.identifier.absfor200511 - Literature in French
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.ariespublicationu4486421xPUB149
local.identifier.citationvolume3
local.publisher.urlhttp://adho.org/en_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Roe_MiningEighteenth_2009.pdf
Size:
473.6 KB
Format:
Adobe Portable Document Format
Description: