Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics

Forkel, Robert; List, Johann Mattis; Greenhill, Simon; Rzymski, Christoph; Bank, Sebastian; Cysouw, Michael; Hammarström, Harald; Haspelmath, Martin; Kaiping, Gereon; Gray, Russell D.

Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics

Date

2018-10-16

Authors

Forkel, Robert

List, Johann Mattis

Greenhill, Simon

Rzymski, Christoph

Bank, Sebastian

Cysouw, Michael

Hammarström, Harald

Haspelmath, Martin

Kaiping, Gereon

Gray, Russell D.

Publisher

Nature Publishing Group

Abstract

The amount of available digital data for the languages of the world is constantly increasing. Unfortunately, most of the digital data are provided in a large variety of formats and therefore not amenable for comparison and re-use. The Cross-Linguistic Data Formats initiative proposes new standards for two basic types of data in historical and typological language comparison (word lists, structural datasets) and a framework to incorporate more data types (e.g. parallel texts, and dictionaries). The new specification for cross-linguistic data formats comes along with a software package for validation and manipulation, a basic ontology which links to more general frameworks, and usage examples of best practices.

URI

http://hdl.handle.net/1885/186596

Collections

ANU Research Publications

Source

Scientific Data

Type

Journal article

Access Statement

Open Access

License Rights

Creative Commons Attribution 4.0 International License

DOI

10.1038/sdata.2018.205

Downloads

File

Description

01_Forkel_Cross-Linguistic_Data_Formats%2C_2018.pdf (876.69 KB)

Full item page

Cultural advice

Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads