Help

Homographs

Revision as of 08:00, 13 January 2022 by Pamputt (talk | contribs) (+ mention of P18)

Homographs (same writing) but not homophone (not same pronunciation), aka require a suffix to differentiates these audios. The suffix should not be pronounced when recording, but it will appear in the filename. For more convenience, this suffix should be added to the List:{ISO}/{list title} you plan to record, before recording it.

Euler diagram showing the relationships between heteronyms and related linguistic concepts.


Rules

  1. If one pronunciation is clearly the norm, no suffix is needed.
  2. For equal rank or rare pronunciations, add to that word a suffix within brackets, example:
    # word (suffix).
  3. This suffix should hint at the difference between two homographs or more.
  4. The suffix must be consistent and stable, ex: if you start with (noun), (verb), keep that exact convention for all your recordings. If you start with a transcription, keep on that transcription. Etc.
  5. The suffix is in the same language as the word, ex : red (noun), အနီရောင် (နာမ်).
  6. Abbreviations can be used if and only if well established in the general public, ex: adj., v., n.

Homographs homophones

Given one language and one speaker, one recording for them all. Even if meaning diverges.

Homographs non-homophones

The following are homographs non-homophones, the part between brackets is not read aloud in LinguaLibre but is used to distinguish those recordings.

Distinction via semantic synonyms :

  • # crooked (injured), pronounced and recorded `crookaid` /ˈkrʊkɪd/
  • # crooked (corrupt), pronounced and recorded `crookt` /ˈkrʊkt/

Distinction via pronunciation using IPA:

  • # crooked (/ˈkrʊkɪd/), pronounced and recorded `crookaid` /ˈkrʊkɪd/
  • # crooked (/ˈkrʊkt/), pronounced and recorded `crookt` /ˈkrʊkt/

Distinction via pronunciation using toned Hanyu pinyin:

  • # 雨 (yǚ), pronounced and recorded `/yː3/`
  • # 雨 (yù), pronounced and recorded `/y:4/`

Distinction via the part of speech :

  • # excellent (v), pronounced and recorded `excel`
  • # excellent (adj), pronounced and recorded `excellant`

In some language, word can be pronounced and recorded differently if read by a man or woman :

  • # vert (masculin), pronounced and recorded `ver`
  • # vert (féminin), pronounced and recorded `verte`

In practice

Within your list such as List:mnw/Commons, transform :

# ကစေံ1
# ကစေံ2
# ကစေံ3
# ကစေံ4

into

# ကစေံ (read)
# ကစေံ (speak)
# ကစေံ (Tang)
# ကစေံ (Te)

You can now record your words, without reading the suffix.

Technical details

The suffix is not part of the word and is stored with the property qualifier (P18) in the Wikibase. See fils (enfant) (Q1686) and fils (pluriel de fil) (Q1685) for example. It is then possible to query recordings without mixing words and suffixes.

See also