Pangloss Collection

From Wikipedia, the free encyclopedia

The Pangloss Collection is a digital library whose objective is to store and facilitate access to audio recordings in endangered languages of the world. Developed by the LACITO centre of CNRS in Paris, the collection provides free online access to documents of connected, spontaneous speech, in otherwise poorly documented languages of all continents.

As of mid-2013, the Pangloss archive contains data from 70 languages.

Principles

A sound archive with synchronized transcriptions

For linguistic science, language is first and foremost spoken language. The medium of spoken language is sound. The Pangloss collection gives access to original recordings simultaneously with transcriptions and translations, as a resource for further research. After being recorded in its cultural context, texts have been transcribed in collaboration with native speakers.

A structured, open architecture

The archived data is structured in accordance with the latest data-processing standards, in an open format, and may be downloaded under a CreativeCommons license. The software used to prepare and disseminate it is open-source. The Pangloss collection is a member of the OLAC network of archival repositories.

External links

This article is issued from Wikipedia. The text is available under the Creative Commons Attribution/Share Alike; additional terms may apply for the media files.