Help

Querying Lingua Libre

Revision as of 21:33, 12 January 2022 by Yug (talk | contribs)

LinguaLibre audios are structured in a database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.

Core structures

The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. An recording item or Qid is a data entry with links to those 3 dimensions.

100%

Language

See also Help:SPARQL#Languages, LinguaLibre:List of languages

This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata (wikidata:Q34770).

Speaker

See also Help:SPARQL#Speakers, DataViz:Speakers.

The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.

Audio recordings

See also Help:SPARQL#Recordings, DataViz:Records.

Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.

Querying the data

SPARQL end points

SPARQL helpers

API helpers

  • Help:APIs – APIs queries relevant to LinguaLibre, including Commons.
  • Special:ApiSandbox – API queries generator for Lingualibre wikipage and wikibase contents.

Modifying the data

(This section needs an author.)

Further reading