Help

Difference between revisions of "Querying Lingua Libre"

LinguaLibre audios are structured in a database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.

m
 
(7 intermediate revisions by the same user not shown)
Line 1: Line 1:
{{@Subtitle:LinguaLibre audios are structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.}}
+
{{#Subtitle:LinguaLibre audios are structured in a [[wikipedia:Semantic Web|Semantic Web]] database where all the recordings and their data are described. This page expose the core structure and redirect you to the relevant documentations or help pages.}}
 
+
__TOC__
== Overview ==
+
== Core structures ==
[[File:LinguaLibre - Data Model Overview.svg|thumb|Overview of the data model of the Lingua Libre database]]
+
The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have [[Special:ListProperties|properties]], which themselves have various values. This, all together, creates our database.
The data model includes 3 core concepts of Lingua Libre.
 
  
 +
<center>[[File:LinguaLibre - Data Model Overview.svg|center|Overview of the data model of the Lingua Libre database|100%]]</center>
 
====Language====
 
====Language====
 
:''See also [[Help:SPARQL#Languages]], [[Special:MyLanguage/LinguaLibre:List of languages|LinguaLibre:List of languages]]''
 
:''See also [[Help:SPARQL#Languages]], [[Special:MyLanguage/LinguaLibre:List of languages|LinguaLibre:List of languages]]''
Line 20: Line 20:
  
 
== Querying the data ==
 
== Querying the data ==
'''SPARQL end points:'''
+
'''SPARQL end points'''
 
* [//lingualibre.org/bigdata/#query LinguaLibre's SPARQL endpoint] – to query Wikidata from Lingualibre, use <code><nowiki>SERVICE <https://wikidata.org/sparql></nowiki></code>.  
 
* [//lingualibre.org/bigdata/#query LinguaLibre's SPARQL endpoint] – to query Wikidata from Lingualibre, use <code><nowiki>SERVICE <https://wikidata.org/sparql></nowiki></code>.  
 
* [[d:Special:MyLanguage/Wikidata:SPARQL_query_service|Wikidata Query Service]] – to query Lingualibre from Wikidata, use <code><nowiki>SERVICE <https://lingualibre.org/sparql></nowiki></code>.
 
* [[d:Special:MyLanguage/Wikidata:SPARQL_query_service|Wikidata Query Service]] – to query Lingualibre from Wikidata, use <code><nowiki>SERVICE <https://lingualibre.org/sparql></nowiki></code>.
  
'''SPARQL helpers:'''
+
'''SPARQL helpers'''
 
* [[Special:MyLanguage/Help:SPARQL|Help:SPARQL]] – examples of SPARQL queries
 
* [[Special:MyLanguage/Help:SPARQL|Help:SPARQL]] – examples of SPARQL queries
 
** [[Special:ListProperties]] – list of all properties used on Lingua Libre
 
** [[Special:ListProperties]] – list of all properties used on Lingua Libre
** [[d:Special:MyLanguage/Wikidata:SPARQL query service/A gentle introduction to the Wikidata Query Service|Wikidata:SPARQL query service/A gentle introduction to the Wikidata Query Service]]
+
* [[Special:MyLanguage/Help:SPARQL 2|Help:SPARQL 2]] (stub) – examples of advanced SPARQL queries
  
'''API helpers:'''
+
'''API helpers'''
 
* [[Help:APIs]] – APIs queries relevant to LinguaLibre, including Commons.
 
* [[Help:APIs]] – APIs queries relevant to LinguaLibre, including Commons.
 
* [[Special:ApiSandbox]] – API queries generator for Lingualibre wikipage and wikibase contents.
 
* [[Special:ApiSandbox]] – API queries generator for Lingualibre wikipage and wikibase contents.
Line 38: Line 38:
 
== Further reading ==
 
== Further reading ==
 
* [[wikidata:Special:MyLanguage/Help:Navigating Wikidata|Help:Navigating Wikidata]] on Wikidata
 
* [[wikidata:Special:MyLanguage/Help:Navigating Wikidata|Help:Navigating Wikidata]] on Wikidata
 
+
== See also ==
 +
{{technicals}}
 
[[Category:Lingua Libre:Help]]
 
[[Category:Lingua Libre:Help]]

Latest revision as of 14:51, 20 November 2022

Core structures

The data model includes 3 core concepts of Lingua Libre : language, speakers, recordings. Items exist for each of these 3 dimensions. Those items have properties, which themselves have various values. This, all together, creates our database.

100%

Language

See also Help:SPARQL#Languages, LinguaLibre:List of languages

This is the language of a speaker or of a recording. This property may point to a language defined on Wikidata (wikidata:Q34770).

Speaker

See also Help:SPARQL#Speakers, DataViz:Speakers.

The speaker is the person that pronounced one or several words in an audio recording. There are various information on the speaker such as their residence at the time of recording and their native tongue.

Audio recordings

See also Help:SPARQL#Recordings, DataViz:Records.

Every recording created with the Record Wizard is added into the database along with some metadata that includes the date the recording was created, the speaker and the language.

Querying the data

SPARQL end points

SPARQL helpers

API helpers

  • Help:APIs – APIs queries relevant to LinguaLibre, including Commons.
  • Special:ApiSandbox – API queries generator for Lingualibre wikipage and wikibase contents.

Modifying the data

(This section needs an author.)

Further reading

See also

Lingua Libre technical helps
Template {{Speakers category}} • {{Recommended lists}} • {{To iso 639-2}} • {{To iso 639-3}} • {{Userbox-records}} • {{Bot steps}}
Audio files How to create a frequency list?Convert files formatsDenoise files with SoXRename and mass rename
Bots Help:BotsLinguaLibre:BotHelp:Log in to Lingua Libre with PywikibotLingua Libre Bot (gh) • OlafbotPamputtBotDragons Bot (gh)
MediaWiki MediaWiki: Help:Documentation opérationelle MediawikiHelp:Database structureHelp:CSSHelp:RenameHelp:OAuthLinguaLibre:User rights (rate limit) • Module:Lingua Libre record & {{Lingua Libre record}}JS scripts: MediaWiki:Common.jsLastAudios.jsSoundLibrary.jsItemsSugar.jsLexemeQueriesGenerator.js (pad) • Sparql2data.js (pad) • LanguagesGallery.js (pad) • Gadgets: Gadget-LinguaImporter.jsGadget-Demo.jsGadget-RecentNonAudio.js
Queries Help:APIsHelp:SPARQLSPARQL (intermediate) (stub) • SPARQL for lexemes (stub) • SPARQL for maintenanceLingualibre:Wikidata (stub) • Help:SPARQL (HAL)
Reuses Help:Download datasetsHelp:Embed audio in HTML
Unstable & tests Help:SPARQL/test
Categories Category:Technical reports