Search Results for “decipherment”

Source: Decipherment

In philology, decipherment is the discovery of the meaning of the symbols found in extinct languages and/or alphabets.
Decipherment overlaps with another technical field known as cryptanalysis, a field that aims to decipher writings used in secret communication, known as ciphertext. A famous case of this was in the cryptanalysis of the Enigma during the World War II. Many other ciphers from past wars have only recently been cracked. Unlike in language decipherment, however, actors using ciphertext intentionally lay obstacles to prevent outsiders from uncovering the meaning of the communication system.
Today, at least a dozen languages remain undeciphered. A notable recent decipherment was that of the Linear Elamite script.

Methods

= External information

decipherment

= Internal information

decipherment

= Computational approaches

decipherment

= Artificial intelligence

decipherment

Deciphering pronunciation

: "In the long time it naturally soundeth sharp, and high; as in chósen, hósen, hóly, fólly [. . .] In the short time more flat, and a kin to u; as còsen, dòsen, mòther, bròther, lòve, pròve". Another example comes from detailed comments on pronunciations of Sanskrit from the surviving works of Sanskrit grammarians.

Challenges

Many challenges exist in the decipherment of languages, including when:

When it is not known which language is closest to it.
When the words in the script are not clearly segmented, like in some Iberian languages.
When the writing system is not known. In specific, if there is little certainty towards the number of graphemes that exist in a certain writing system, it cannot be determined if that system is an alphabet, a syllabry, a logosyllabry, or something else.
When the reading direction is not known. For example, it may not be clear if a writing system is meant to be read from left to right, or from right to left.
When it is not known if a script uses punctuation or spaces between words.
When the language of a script subject to decipherment efforts is not known.
When there is a small dataset available to learn about the properties of a script. This could lead to issues such as an incomplete vocabulary being known for the script.
When the typical order between subjects, objects, and verbs is not known.
When it is not known whether or how certain words can change their form.
When it is not known when multiple symbols are used to represent the same sound, syllable, word, concept, or idea (allographs).
When it is not clear how the penmanship or the style of writing of a particular scribe relates to the style of writing of another scribe working in the same text (the same letters or words might be written in a way that looks different), in which case it is difficult to correlate information across multiple examples of the use of the writing system.
When it is not known if certain words change their meaning depending on the context they appear in (homonyms).
When the context of discovery of a writing is not known. This is because information about the location out of which a writing system came from can provide valuable information about its relationship to known languages.
When adequate digital datasets for documented writing systems is not available, limiting the ability to use computational methods for decipherment.
When sufficient hardware resources, such as high performance computing, is not available (which might be necessary for more computationally intensive computational methods).

Notable decipherers

= Deciphered scripts

=
Cuneiform
Egyptian hieroglyphs
Kharoshthi
Linear B
Mayan
Staveless Runes
Cypriot Syllabary

= Undeciphered scripts

=
Rongorongo (Decipherment of rongorongo)
Indus script
Cretan hieroglyphs
Byblos syllabary
Linear A
Linear Elamite
Cypro-Minoan syllabary
Espanca
Numidian language