Coexistence and variation in numerals and writing systems

Well, it only took about 20 minutes for Dan Milton to solve the mystery of the Egyptian stamp: it has four distinct numerical notation systems on it: Western (Hindu-Arabic) numerals, Arabic numerals, Roman numerals, and most prominently but obscurely, the ‘Eye of Horus’ which served, in some instances, as fractional values in the Egyptian hieroglyphs:

egypt-stampAt the time I posted it this morning, it was the only postage stamp I knew of to contain four numerical notation systems.   (As Frédéric Grosshans quickly noted, however, a few of the stamps of the Indian state of Hyderabad from the late 19th century contain Western, Arabic, Devanagari, and Telugu numerals, and also meet that criterion, although all four of those systems are closely related to one another, whereas the Roman numerals and the Egyptian fractional numerals are not closely related to the Western or Arabic systems.    So that’s kind of neat.  I have a little collection of stamps with weird numerical systems (like Ethiopic or Brahmi), multiple numeral systems (like the above), unorthodox Roman numerals (Pot 1999), etc., and am looking to expand it, since it is a fairly delimited set and, as a pretty odd basis for a collection, isn’t going to break the bank.  In case I have any fans who are looking for a cheap present for me.  Just sayin’ …

We in the West tend to take for granted, today, that really there is only one numerical system worthy of attention, the Western or Hindu-Arabic system, which is normatively universal and standardized throughout the world.  We also tend to feel the same way about, for instance, the Gregorian calendar.   That’s a little sad but not that surprising.   But we also take it for granted that, in general, throughout history, each speech community has only one set of number words, one script, and one associated numerical notation system.    Of course, a moment’s reflection shows us this isn’t true: virtually any academic book still has its prefatory material paginated in Roman numerals, not to mention that we use Roman numerals for enumerating things we consider important or prestigious, like kings, popes, Super Bowls, and ophthalmological congresses.   And this is not to mention other systems like binary, hexadecimal, or the fascinating colour-based system for indicating the resistance value of resistors.    I’ve complained elsewhere that we put too much emphasis on comparing one system’s structure negatively against another, but to turn it around, we should ask what positive social, cognitive, or technical values are served by having multiple systems available for use.

We need to be more aware that the simultaneous use of multiple scripts, and multiple numeral systems, simultaneously in a given society is not particularly anomalous.       In Numerical Notation (Chrisomalis 2010), I structured the book system by system, rather than society by society, which helps outline the structure and history of each individual representational tradition, and to organize them into phylogenies or families.  But one of the potential pitfalls of this approach is that it de-emphasizes the coexistence of systems and their use by the same individuals at the same time by under-stressing how these are actually used, and how often they overlap.    Just as sociolinguists have increasingly recognized the value of register choice within speech communities, we ought to think about script choice (Sebba 2009) in the same way.   With numerals, we also have the choice to not use number symbols at all but instead to write them out lexically, which then raises further questions (is it two thousand thirteen or twenty thirteen?) – many languages have parallel numeral systems (Ahlers 2012; Bender and Beller 2007).   We need to get over the idea that it is natural or good or even typical for a society to have a single language with a single script and a single numerical system, because in fact that’s the exception rather than the norm.

The stamp above is a quadrilingual text (French, Arabic, Latin, Egyptian) in three scripts (Roman, Arabic, hieroglyphic) and four numerical notations (Western, Arabic, Roman, Egyptian).    We should think about the difficulty of composing and designing such a linguistically complex text – it really is impressive in its own right.  We should also reflect on the social context in which the language of a colonizer (French), the language of the populace (Arabic), and two consciously archaic languages (Latin and Egyptian), and their corresponding notations, evoke a complex history in a single text.  Once we start to become aware of the frequency of multiple languages, scripts, and numeral systems within a single social context, we have taken an important step towards analyzing social and linguistic variation in these traditions.

 

Ahlers, Jocelyn C. 2012. “Two Eights Make Sixteen Beads: Historical and Contemporary Ethnography in Language Revitalization.” International Journal of American Linguistics no. 78 (4):533-555.

Bender, Andrea, and Sieghard Beller. 2007. “Counting in Tongan: The traditional number systems and their cognitive implications.” Journal of Cognition and Culture no. 7 (3-4):3-4.

Chrisomalis, Stephen. 2010. Numerical Notation: A Comparative History.  New York: Cambridge University Press.

Pot, Hessel. 1999. “Roman numerals.” The Mathematical Intelligencer no. 21 (3):80.

Sebba, Mark. 2009. “Sociolinguistic approaches to writing systems research.” Writing Systems Research no. 1 (1):35-49.

What makes this stamp unique?: a contest

To the best of my knowledge, this Egyptian postage stamp, along with the two other denominations in the same 1937 series (5 mills and 15 mills), are unique in a very specific way.  My puzzle to you is: what makes these stamps so special?

egypt-stamp

Place your guess by commenting below (one guess per person).  If you are the respondent with the correct answer, your ‘prize’ is that you may ask me any question relating to the themes of this blog and I will write a separate post on that subject.    Happy hunting!

Edit: Well, that didn’t take long.  In just over 20 minutes, Dan Milton successfully determined the answer.  In case you still want to figure it out on your own, I won’t post the answer here in the main post, but you can find it in the comments if you’re stumped.  I will follow up with some analysis later.

Writing systems blogs: filling a gap

You may have noticed that I have (very slightly) changed the subheading for Glossographia from the former ‘Anthropology, linguistics, and prehistory’ to the new ‘Anthropology, linguistics, archaeology, and writing systems’.  I don’t actually post on prehistoric archaeology very much at all, but I do post on the archaeology of literate societies reasonably often.   And, in particular, I post on issues in epigraphy, writing systems, and literacy very often, so I thought it fitting to promote that subject to its current place of prominence.

Unfortunately, in my experience, there just aren’t that many blogs that focus on issues relating to writing systems and literacy.     Sure, each regional archaeological tradition in the ancient world has its blogs that occasionally discuss textual evidence, inscriptions, paleography, and so on – some of them, like Rollston Epigraphy, do so regularly, others sporadically.   Even then, I don’t know of any academic blogs that focus on cuneiform or on Egyptian hieroglyphs (though I’d love to be shown to be wrong).   And of course, Language Log and other general linguistics blogs do occasionally touch on written language.    But in terms of actual academic blogs that are dedicated in part or in whole to writing systems (their typology, their history, their linguistic features) or literacy (the social and cognitive context and use of writing), there isn’t that much out there.    Maya Decipherment is the most prominent, surely because of the importance of David Stuart in the field.   There’s BabelStone, where Andrew West has, very quietly, been blogging for eight years on Ogham, on Central / Inner Asian scripts, and on issues relating to typography and Unicode.  The wonderful Shady Characters focuses specifically on punctuation – check it out – it deserves a massive readership.  The Omniglot site covers writing systems in some detail, and the corresponding blog does sometimes cover issues in writing systems.   Years ago we used to have Abecedaria, which had general information on writing systems with a focus on the ancient Near East and Levant, but it’s long defunct.   Michael Everson  used to blog about the letter þorn at þorn.info, but that’s been quiet for a year or more.    And … well, that’s about all I know of or read regularly.  Do you know of any others?

So, the change in the header does not reflect an actual change in what I’ll be posting, but is simply a recognition that there really is a need for a blog with a focus on general issues on writing systems and literacy, and that since 2008, this has been one of a relatively small number of places that actually does that.

Is the Voynich Manuscript structured like written language?

This week has seen a bumper crop of news stories about a new piece of research in PLOS ONE by Marcelo Montemurro and Damian Zanette, who are both physicists who specialize in complex systems.     The paper in question is not about physics, however, but argues that the mysterious Voynich Manuscript has properties that suggest that it has language-like structure, based on an information-theoretic analysis of the structure of its words.    If correct, while this is certainly not a ‘decipherment’, this result would be counter-evidence to certain versions of the theory that the VM is a medieval hoax that is undecipherable because it is pseudo-writing, meant to have the appearance of language but having no decipherable content in any natural language.

Now, I am not a specialist in information theory, and I’m not truly a specialist on the Voynich Manuscript (although I have played one on TV), but I am a linguist and I do research on writing systems and allied representation systems like written numerals.      And several things bother me about this paper.  The first is that, as Gordon Rugg (the most significant modern proponent of a ‘hoax’ theory) has pointed out in a comment on the new paper, no one is seriously claiming that the VM is pure ‘noise’ – it clearly is structured, and simply because the VM has some structure, even one that resembles language in some ways, does not entail that it is likely to have a genuine linguistic structure, much less a decipherable one.  Rugg’s own (plausible) theory involves the use of a medieval ciphering system to rapidly produce language-like but meaningless text as part of a hoax, and Montemurro and Zanette have not evaluated this theory at all, as far as I can see, other than to dismiss it.

Furthermore, the only systems to which the VM is compared are two written languages in alphabetic scripts (English and Latin), one written language with a non-alphabetic script (Chinese), one computer language (Fortran), and one natural sequence (yeast DNA).  But there are a wide variety of nonlinguistic, quasilinguistic, and paralinguistic phenomena aside from these, and they haven’t compared the VM to any of them.   Montemurro and Zanette show conclusively that the VM has much more ‘information’ (structure) than the yeast DNA, which we would anticipate, but does not do a good job of accounting for the different types of encoded information, and structured non-information, which might be comparable to the VM.  What is the information structure of known codes and ciphers (both broken ones and undeciphered ones)?   What is the information structure of semasiographic systems like the glyphic system at Teotihuacan?   What is the information structure of the linguistic productions of psychiatric patients who suffer from graphomania?  What is the information structure of pseudo-writing like the Codex Seraphinianus which we know (since it’s a modern piece of conceptual art) carries no message?      None of these comparisons would be conclusive but all of them would be informative.   Right now the range of systems to which Montemurro and Zanette have compared the Voynich is simply too limited to be useful.

Montemurro and Zanette are also seemingly unaware of parallel efforts to use the information structure of undeciphered scripts to evaluate their language-like nature.  Two of the most significant such efforts are the effort to show that Iron Age Pictish graphic symbols from Scotland constituted a phonetic script (Lee, Jonathan and Ziman 2010) and efforts to show that the Indus script of Harappan-period India and Pakistan either does (Rao et al 2009) or does not (Farmer, Sproat and Witzel 2004) resemble linguistically-based writing systems.  These theories have attracted a reasonable degree of attention from linguists, and Richard Sproat, in particular, has done a lot of work trying to address the non-linguists’ methodological and conceptual approaches, some of which has been covered in extraordinary detail at the Language Log.    There’s a much longer discussion to be had there, but suffice it to say that most linguists are skeptical of studies undertaken without any linguistic expertise and assistance.   Again, without taking a position on any of these controversies, it strikes me as irresponsible literature-searching that the Montemurro and Zanette study is so fundamentally unaware of similar efforts in major publications such as Science and the Proceedings of the Royal Society.    If you’re going to use physics to study written language, even if you’re going to ignore every single linguist who’s written on these subjects, maybe you should at least be aware of high-impact articles written in the last ten years by physicists using very similar methods to your own.

For the record, I think that any information-based effort that does not involve linguists at a serious level is likely to make invalid assumptions and thus be highly prone to producing nice-looking gibberish.  For example, the Montemurro/Zanette theory seems to grant that the VM probably does not encode information alphabetically like English, and then suggests instead that it recalls “scripts where -as in the cases of Chinese and hierographical Ancient Egyptian- the graphical form of words directly derives from their meaning.” (Montemurro and Zanette 2013: 4).   Let’s assume we are prepared to set aside their use of the term hierographical, which is a bizarre nineteenth-century anachronism that was vaguely popular for a time prior to the decipherment of the Rosetta Stone, but which has never, in any European language, been a preferred term.   More significantly, it is a gross, entirely improper characterization of Chinese and Egyptian to argue that the “form of words directly derives from their meaning”.     Both scripts have massive phonographic components with some representation of morphemes, words, and semantic categories with signs, as every expert on writing systems has known for thirty years or more – certainly the work of John deFrancis shows this eminently clearly.    Even lumping the Egyptian hieroglyphic and Chinese scripts together in a single category ignores the massive differences between them.   So in essence, Montemurro and Zanette seem to be suggesting that the VM has properties similar to no writing system ever known to have been used on earth, because they do not seem to know what sorts of writing systems they are comparing things to.

In short, I’m afraid what we have here is another case of non-specialists applying the methods of one field inappropriately to some actually complex linguistics problems to evaluate a text whose decipherers (a group riddled with charlatans and cranks) have offered us everything except an actual decipherment.

Maya Decipherment blog / research tool

Last week, a nice article came out about Mayan epigraphy and specifically about David Stuart’s Maya Decipherment blog.    Don’t be fooled by the title – this isn’t about Zooniverse-style crowdsourced science (which has its own merits and challenges), but about the way that the blog is being used for early discussion and review of new ideas by very prominent scholars (which Stuart clearly is) – not supplanting peer review but in parallel to it.  I particularly like the article’s emphasis on the ongoing and collaborative nature of Maya epigraphy (while acknowledging that it can be an incredibly contentious field at times).    I would, however, like to propose a moratorium on the journalistic use of the word ‘mysteries’ in reference to archaeological findings.