LinguaLibre

Difference between revisions of "Events/2021 Wikimedia Wikimeet India"

< LinguaLibre:Events

 
(14 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
 +
[[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb|Notes are available on [https://etherpad.wikimedia.org/p/wmwmd03s10 etherpad.wikimedia.org/p/wmwmd03s10]]]
 +
[[File:Lingua_Libre_-_record_your_language_-_Lingua_Libre_bot_and_recording_wizard.webm|thumb|Video]]
 
* '''Description''': Presentations on Lingualibre recording contribution, and on Github opensource assets.
 
* '''Description''': Presentations on Lingualibre recording contribution, and on Github opensource assets.
* '''Place''': online.
+
* '''Place''': Zoom > https://zoom.us/j/97614383815?pwd=OW9QazdjeVFaOTF1VXQxZ0U2ZkpCUT09
 
* '''Time''': 2021/02/19-21 > Sunday 21: 13h25 UTC/14h25 UTC+1.
 
* '''Time''': 2021/02/19-21 > Sunday 21: 13h25 UTC/14h25 UTC+1.
 
* '''Organisation''': Indian Wikipedia
 
* '''Organisation''': Indian Wikipedia
Line 6: Line 9:
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_-_record_your_language|/Lingua_Libre_-_record_your_language]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_-_record_your_language|/Lingua_Libre_-_record_your_language]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_bot_and_recording_wizard|/Lingua_Libre_bot_and_recording_wizard]]
 
*# [[:meta:Wikimedia_Wikimeet_India_2021/Submissions/Lingua_Libre_bot_and_recording_wizard|/Lingua_Libre_bot_and_recording_wizard]]
* '''Participants:'''
+
* '''Participants:''' ~60 watchers, largely Indians leading contributors, movement leaders and developers.
 +
* '''Outcome:''' outreach successful, to continue.
 +
* '''Review & improvements:''' continue work on the presentation. Some proposals [[LinguaLibre:Events/2021 Wikimedia Wikimeet India#Post-event_review|here]].
 +
* '''Program:''' [[:meta:Wikimedia_Wikimeet_India_2021/Program]]
 
<noinclude>
 
<noinclude>
 
== Conference details ==
 
== Conference details ==
Line 15: Line 21:
 
# Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).
 
# Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).
  
== Lingualibre : why & how to contribute ==
+
== Presentation ==
 +
Presentation was mainly in 4 sections :
 +
* Lingualibre : why & what. By Adelaide.
 +
* Lingualibre : how to contribute / demonstration. By Adelaide.
 +
* Lingualibre Technical side : Generalities. By Yug.
 +
* Lingualibre Technical side : [[LinguaLibre:Bot]] roles and request process. By Poslovitch.
 +
* Q&A. By team.
  
== Technical presentations ==
+
'''Repositories'''
'''1. Status du Github : Need you to spread the word about language conservation'''
 
* maintenu propre
 
* 11 repositories
 
* Referees on nearly each repository
 
* JS, Python, NodeJS, PHP, MediaWiki modules,
 
* Welcome volunteer devs
 
'''2. Key repositories'''
 
 
{| class="wikitable" style="background:#FFFFDD"
 
{| class="wikitable" style="background:#FFFFDD"
 
!colspan=3|  <center>'''Github.com/Lingualibre/'''</center>
 
!colspan=3|  <center>'''Github.com/Lingualibre/'''</center>
Line 36: Line 41:
 
| LinguaRecorder  || JS, NodeJS                    || Js library controlling audio recordings
 
| LinguaRecorder  || JS, NodeJS                    || Js library controlling audio recordings
 
|-
 
|-
| QueryViz        || SparQL                        || Helps extract data, files, meaning from Lili
+
| SignIt          || JS, OOJS-UI, CSS, NodeJS      || Helps teach sign language
 
|-
 
|-
| SignIt          || JS, OOJS-UI, CSS, NodeJS      || Helps teach sign language
+
| QueryViz (not presented) || SparQL                        || Helps extract data, files, meaning from Lili
 
|-
 
|-
 
|colspan=3| <small>Notes:</small>
 
|colspan=3| <small>Notes:</small>
 
|}
 
|}
'''3. How to help LinguaLibre via tech'''
 
* Authorize LinguaLibre Bot on your wiki
 
** Brief summary of the process.
 
* Tell your FOSS community about LinguaLibre
 
* ...
 
'''4. Q&A'''
 
  
== Review and improvements ==
+
== Post-event review ==
 +
Here are possible improvements.
  
[[File:Lingua_Libre_presentation_WMFr_for_the_Wikimedia_Meet_India_2021.pdf|thumb|Notes are available on [https://etherpad.wikimedia.org/p/wmwmd03s10 etherpad.wikimedia.org/p/wmwmd03s10]]]
 
 
'''General part'''
 
'''General part'''
 
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
 
* Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
Line 57: Line 56:
 
'''Demonstration :'''
 
'''Demonstration :'''
 
* (Was so complete and systematic ! Merci Adélaide !)
 
* (Was so complete and systematic ! Merci Adélaide !)
* by default, the limit is <s>380 every three days</s> 380 per 72mins. for newly created accounts and normal accounts. See [[Help:Ratelimit]] for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
+
* by default, the limit is <s>380 every three days</s> 380 per 72mins. for newly created accounts and normal accounts. See [[LinguaLibre:User_rights#User_rights_on_LinguaLibre]] for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
* When you change the "stop threshold" <code>5</code> to record sentences, you have to move it '''down''' if you want longer audios.
+
* When you change the "stop threshold" <code>5</code> to record sentences, I think (?) it's the volume level under which the audio will stop. So we have to move it '''down''' if you want longer audios.
* But mainly, the "silence length" should be the single element to change.
+
* The "silence length" should be the single element to change.
 
'''Technical side :'''
 
'''Technical side :'''
 
* Add lingualibre.org/datasets/ & talk about external usages
 
* Add lingualibre.org/datasets/ & talk about external usages
 
* Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
 
* Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
 +
* Referees on nearly each repository
 +
* Welcome volunteer devs
 
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
 
* "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
* Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )
+
* RecordWizard : a point on the Video recording variation.
 +
* Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )
 +
 
 +
Also:
 +
* Improve calls for action and when we write/say them. "Come and try it out".
 +
* Visually thanks the community. (nothing in the slides)
 +
* A final slide with the Recording Wizard recording button's screenshot.
  
== Other interesting presentations ==
+
== Networking ==
 
=== Content Translation Tool ===
 
=== Content Translation Tool ===
 
Lead by the WMF's Language team. See [[:mw:Wikimedia Language engineering]], [[:meta:Category:Language research]] & [[:meta:Research:Section_Translation_Design_Research]].
 
Lead by the WMF's Language team. See [[:mw:Wikimedia Language engineering]], [[:meta:Category:Language research]] & [[:meta:Research:Section_Translation_Design_Research]].

Latest revision as of 21:46, 23 February 2021

Video

Conference details

  1. Schedule:meta:Wikimedia_Wikimeet_India_2021/Program
  2. Place: Zoom.us with pre-conference trial session.
  3. Checklist: meta:Wikimedia_Wikimeet_India_2021/Checklist
  4. Telegram group for presenters: https://t.me/joinchat/VJ78lQaYYQTDK_Dh
  5. Other forms of communication: wmwm@cis-india.org (Indian Standard Time, UTC+0530 hours).

Presentation

Presentation was mainly in 4 sections :

  • Lingualibre : why & what. By Adelaide.
  • Lingualibre : how to contribute / demonstration. By Adelaide.
  • Lingualibre Technical side : Generalities. By Yug.
  • Lingualibre Technical side : LinguaLibre:Bot roles and request process. By Poslovitch.
  • Q&A. By team.

Repositories

Github.com/Lingualibre/
Repository Technologies (Stack) Definition and impact
Lingua-Libre-Bot Python, autorization Spread 400,000 audios in your wiki
RecordWizard JS, VueJS, CSS, PHP MW-module Mediawiki UI module to record audios/video (sign language)
LinguaRecorder JS, NodeJS Js library controlling audio recordings
SignIt JS, OOJS-UI, CSS, NodeJS Helps teach sign language
QueryViz (not presented) SparQL Helps extract data, files, meaning from Lili
Notes:

Post-event review

Here are possible improvements.

General part

  • Remind possible usages : Wikimedia Websites, Language preservation, Language learning, Natural language processing (text2speech, speech2text)
  • The basics: this is a "recording project". We currently focus on words, but we can think and expand to more orality and knowledge related services.

Demonstration :

  • (Was so complete and systematic ! Merci Adélaide !)
  • by default, the limit is 380 every three days 380 per 72mins. for newly created accounts and normal accounts. See LinguaLibre:User_rights#User_rights_on_LinguaLibre for details. But! 380/hour for 4 days old accounts and an easy process to raise this significantly.
  • When you change the "stop threshold" 5 to record sentences, I think (?) it's the volume level under which the audio will stop. So we have to move it down if you want longer audios.
  • The "silence length" should be the single element to change.

Technical side :

  • Add lingualibre.org/datasets/ & talk about external usages
  • Remind possible usages : Wikimedia websites via bots, Language learning apps, Natural language processing (text2speech, speech2text).
  • Referees on nearly each repository
  • Welcome volunteer devs
  • "We think about more features and improvements that we can develop" (therefore we need devs). See Phabricator.
  • RecordWizard : a point on the Video recording variation.
  • Do we have a long term strategy. What is our development vision (it's redundant with Adelaide request to define a long term )

Also:

  • Improve calls for action and when we write/say them. "Come and try it out".
  • Visually thanks the community. (nothing in the slides)
  • A final slide with the Recording Wizard recording button's screenshot.

Networking

Content Translation Tool

Lead by the WMF's Language team. See mw:Wikimedia Language engineering, meta:Category:Language research & meta:Research:Section_Translation_Design_Research.