Variant Roman numerals: a project

Yesterday I thought of a great new project that could be a nice little article, or, if I had a grad student with a background in classical archaeology, as a nice little thesis, or, if someone else wants to work with me, a co-authored paper. Heck, if you scam my idea, more power to you – I will cite you widely if it’s good, and mock you widely if not! You see, the Epigraphische Datenbank Clauss-Slaby is a searchable full-text database with over 350,000 Latin inscriptions (including over 20,000 images). You can enter a word (e.g, Germaniae) and it returns all the inscriptions that have that word. Nifty, huh? Just in mucking about with EDCS today I discovered two or three things that will be coming out in my book that are in need of revision, which makes me only a little bitter.

Now of course I’m not a classicist (I have three terms of Latin under my belt, but that’s hardly enough to make me an expert), but I do know a thing or three about Roman numerals. The study of Roman numerals is sorely neglected in modern epigraphy, which is a shame because there are some really interesting social questions to be asked relating to regional identity and literacy (the sort of stuff, e.g., that Greg Woolf does). We think that we know Roman numerals: just take I, V, X, L, C, D, and M, string them together in groups of no more than three, use subtractive notation for numbers like 9 and 44, and you’re done. But it isn’t so simple.

The Roman numerals are not a static and unified system; there are various expressions for the same number (e.g. XVIII vs. XIIX for 18, or XXXXX vs. L for 50). Back in the 1950s, Arthur and Joyce Gordon did some interesting statistical analysis, indicating some potential sources of this variability (chronological, regional, and textual), but he didn’t have the sort of massive resources that the EDCS provides. So, for instance, it is often said that IIIII for 5, XXXXX for 50, and CCCCC for 500 (i.e., not using the sub-base signs V, L, and D) are particularly found in African inscriptions. Well, a quick search for ‘CCCCC’ and ‘XXXXX’ suggest to me that this isn’t a full explanation. Are certain types of inscription more likely to contain these variants? Could we be dealing with a chronological difference? Could we be dealing with a variant typical of minimally literate writers, or writers of informal texts? Or could it be that the shorter forms are used when there’s less room on the medium, with longer variants used when space is not at a premium? I have no idea, but the only way to find out would be to build a list of inscriptions that use these variants, map them in time and space, and evaluate them in terms of the texts in which they occur.

Now, there are some methodological complexities: some of the interesting variation is between different forms for the same character, and there is no way to search for that. Some of the Roman numeral forms (the use of a horizontal bar or vinculum over a numeral to indicate multiplication by 1000) aren’t represented consistently, or at all, so one would just need to rely on other published material to find the relevant inscriptions. And quite a lot of the project would require taking the database results and then referring to the Corpus Inscriptionum Latinarum. But ultimately it would be taking what seems to be a rather dry subject (variability in Roman numerals) and potentially correlating it with variability in social identities (class, ethnic, professional). Well, I think it’s cool, anyway.

A typology of quotation marks

I’ve been thinking about quotation marks lately (okay, now I’ve lost 99% of my readership already, way to go, Steve!) and the different ways we use them. Because I have a strong interest in literacy and culture and the way in which language gets turned into text, these sorts of things excite me in a way that is probably not entirely healthy, but then again, if I wasn’t, you wouldn’t have a post to read. So without further delay, I give you…

“A” “typology” “of” “quotation” “marks”

Quotative: This is the common case in which quotation marks serve to distinguish matter spoken or written in another context, with the presumption that the quoted matter is being reproduced somewhat faithfully. The material may have been spoken or written originally, but there is a much higher expectation of word-for-word reproduction when quoting written material, for the obvious reason that the writer can copy from a written source. This was the original, and remains the most common sense of quotation marks in printed matter. It helps us to distinguish sentences like

Martha said, “Canada is a fascist dictatorship.”

from

Martha said that Canada is a fascist dictatorship.

In the first case we are clearly meant to understand that Martha spoke those words, where in the second Martha might well have said, “Our government is heading towards fascism” or any number of other things.

Neologistic: Quotation marks are frequently used when an author coins a neologism, or coins a phrase using already existing words. One is not quoting some earlier source directly; one is seeking instead to indicate the novelty of the term being used. So, for instance, in this post, I write:

The effect of this ‘conspicuous computation’ was to impress the reader with the vastness of the quantity, serving as an indexical sign of Rome’s military might.

I’m not quoting myself here – I’m coining a new phrase and using quotation marks to alert the reader to this fact. We get into trickier ground when we put quotation marks around a single, existing word that we intend to use in a new sense, as in the following passage:

Let me explain first what I understand by “sociolinguistic”. I use the term in its adjectival form and speak of “sociolinguistic” kinds of research rather than “sociolinguistics”. (Hymes 1971: 42)

The context strongly suggests neologism, but another reading is that Dell Hymes (the author, a renowned sociolinguist / linguistic anthropologist) is seeking to dismantle the entire concept of sociolinguistics, or at least to shift its meaning substantially in this context. If so, we’re dealing with another sense entirely.

Distancing: The quotation marks serve to distance the author from the matter in quotes, but where that matter is not a faithful reproduction of other matter. One finds these very often in the titles of British newspaper articles, possibly because British libel laws are very strict and one could find oneself liable for making a statement that is not a direct quotation of another source but which is also not hard fact. They frequently have a quotative smell to them, insofar as they often relate to assertions or claims by another party, but in fact they are not quotative at all, and often appear to be paraphrases at best. I posted about this elsewhere a couple of years ago, and I still find this use jarring. An example:

‘Many killed’ in Yemen air raid

The BBC is not trying to say that someone wrote or said the words “many killed” in that order or even that the quote is an abbreviation of “many people were killed”. It is reporting on others’ claims, true, but the purpose is not quotative. We can think of the distancing quotes as being quotative minus the condition of (near-)faithfulness.

Ironic: Ironic quotation marks often also distance the author from the words written, but more importantly, distance the meaning of the quoted matter from its standard or accepted one. These are often called “scare quotes” by academics, a term which I find bothersome because they aren’t meant to scare anyone. I am indebted to my colleague Jacalyn Harden who came up with the metaphor of quotation marks as eyelashes – ironic quotes serve as a textual “wink” alerting the reader that some novel sense is intended. Wikipedia uses an example from my late mentor, Bruce Trigger:

Moctezuma II was reported to have had two wives and many concubines, by whom he had a total of 150 children. The king of Texcoco was said to have had more than two thousand “wives” by whom he had had 144 children, 11 born of his chief wife. (Trigger 2003: 178)

So we understand here that two thousand “wives” in the second sentence is not to be understood in the same sense as two wives in the first sentence. In both cases, a ruler is claimed (“reported” vs. “said”) to have some number of wives, so we can tell that the difference is not due to the quotative vs. non-quotative distinction. Because I knew the author of those words and worked on that very book, that I do not think that Trigger meant them deconstructively (see below) or in any other sense. Rather, it is because having two wives is not at all uncommon (even having two wives simultaneously is hardly a historical anomaly), but having two thousand wives strains credulity: the semantic associations we derive from the word wife could never be extended to the relationship between one man and two thousand women.

Deconstructive: There are scare quotes, and then there are “scare quotes”, and these are the latter. Where ironic quotes use the word in a different sense than that intended, deconstructive quotes imply that the object being quote-marked does not in fact exist. So, for instance, when one talks about “race” as opposed to race, one is noting that there is no biological reality to the race concept. Perhaps the most fantastic and potentially incomprehensible example is the following, from the linguistic anthropologist Michael Silverstein:

The important fact, then, is that “I” am to a certain extent what “I” say about “what” “I” drink as much as what “I” say about “it” reflects what “I” can discern about “what” “it” is. (Silverstein 2006)

In Trigger’s example above, he cannot mean that wives do not exist at all – he explicitly rejects this by his use of the un-quote-marked word in the previous sentence, and the un-quote-marked word wife in the second sentence as well. There are wives, and then there are “wives”. But in Silverstein’s example, he is really saying that “I” and “what” and “it” (the latter two referring to ‘that which I drink’) do not exist as real entities – they are socially constructed, to use one well-understood if less-than-ideal term. In ironic quotation marks, “A” is not A, but B, while in deconstructive ones, “A” is not A and is not anything else either.

Emphatic: The quotation marks serve as visual emphasis alone, and are not meant in an ironic, distancing, or quotative function. Most writers, I suspect, would treat this usage as an error, but it is widespread enough to deserve our attention. It is most frequently found on mercantile and informational signs, especially handmade ones. I refer you to The “Blog” of “Unnecessary” Quotation Marks, which gives such great examples as:

Closed “Monday”

“Fire Exit”
Please Do Not Use
Alarm Is On

and my personal favourite:

Thank “God” For All the Troops

These can clearly be excluded from the five other categories. Instead, the quotation marks serve as a sort of typographic highlighter, a means of emphasizing some words in the text. This is confirmed by the contextual association of emphatic quotes with billboards, signs, placards, and other texts meant for wide public visibility, and by the fact that many of the quote-marked words are also emphasized in some other way: boldface, underlined, capitalized, or in larger letters than the rest of the text. Are they truly “unnecessary”? Yes, in the sense that there are other ways to emphasize text, and because this sense is non-standard, some humor derives from understanding emphatic quotes as meaning something else (usually ironic). For instance, take this discussion at the unnecessary quotations blog over the sign Sellersburg Welcomes “President” George W. Bush. Sly jab at perceived electoral fraud, or over-ebullient semantic extension of well-known punctuation? You decide.

It would be very interesting to expand this analysis to specify more clearly the “etymology” (ironic) of each of the six forms and then to examine the historical and semantic relations among them. For instance, I suspect that the quotative and neologistic usages are earliest but that the broad semantic aspect of distance is what unifies all the senses except the emphatic. I also think one could do some very interesting corpus linguistics using students to code instances of quotation reliably, both in terms of frequency in different texts and in terms of this semantic typology. Finally, I haven’t even discussed the use of single versus double quotes (which could have some interesting correlations with my typology), or talked about “embodied” (neologistic) quotation marks in the form of “air quotes” (quotative?). Well anyway, if I write the paper, it’ll give me something to “talk” (ironic) about.

Works cited
Hymes, Dell. 1971. The Contribution of Folklore to Sociolinguistic Research. The Journal of American Folklore 84, no. 331 (March): 42-50.
Silverstein, Michael. 2006. Old wine, new ethnographic lexicography. Annual review of anthropology 35: 481-496.
Trigger, Bruce G. 2003. Understanding early civilizations: a comparative study. Cambridge Univ Press.

Ysteriousmay esselvay

Archaeologists working at Mount Zion in Israel have uncovered a stone vessel (dated between 37 BCE and 70 CE by archaeological association) bearing a cryptic script. The vessel, which is around 13 cm high, bears ten lines of writing scratched into the stone, but the inscription, which appears to be a mixture of Hebrew and Aramaic (we are not told on what grounds), but has not yet been deciphered.

Now that’s interesting. Let’s think about that for a minute. The article claims that “the cup’s script appears to be a secret code, written in a mixture of Hebrew and Aramaic, the two written languages used in Jerusalem at the time”. Both Hebrew and Aramaic are well-understood scripts that represent well-understood languages. So what on earth is going on here?

We don’t have a transcription at all, and I’ve been unable to find any further information yet, but what I suspect is going on is that some of the characters on the vessel are in Hebrew script, and others in Aramaic script, but that the translation of these doesn’t yet reveal a meaningful interpretation in either the Hebrew or Aramaic languages. It’s possible that the inscription records a third language (oh, let’s say … Phrygian, just to be obscure yet controversial), or that it is some sort of linguistic code (think ‘Pig Aramaic’, if you will) or even a substitution cipher. We just don’t know what the language is yet – and that’s really the most likely way we could get a text like this in two well-known scripts that we can’t read.

This is often a problem in media reporting in paleography and epigraphy – no distinction is made between the writing system (script) and the language of the inscription. The good news is that the research team is “sharing pictures of the cup with experts on the writing of the period. The researchers also plan to post detailed photos of the cup and its inscriptions online soon.” Now that’s going to make progress a lot faster.

Levantine hieroglyphs in the Early Bronze Age

A 4cm fragment of a carved stone plaque (photo here) has been found in northern Israel at the site of Tel Bet Yerah, depicting an arm bearing a scepter and an early form of the Egyptian ankh symbol. It appears to date to the First Dynasty of Egypt (ca. 3000 BCE) – by some centuries, this is the earliest evidence of Egyptian writing outside of Egypt proper – although it isn’t a text that could be understood linguistically, and in fact is very small. The press release from Tel Aviv University isn’t clear how it was dated (whether contextually by association with other material, or paleographically/iconographically from the style of the inscription), but notes that it is “the first artifact of its type ever found in an archaeological context outside Egypt”, whatever ‘of its type’ means. Either way, this is strong evidence that Egyptian representational traditions were known in the Levant in the Early Bronze Age, 1500 years before the New Kingdom, when Egypt first exercised direct political authority in the region.

Ajami and Western numerals

Pardon me as I sort out my long list of posts that got shelved prematurely this summer during my fieldwork. There is a really neat little article entitled Lost Language in Bostonia, the Boston University alumni magazine. It’s a fascinating look at the research of Fallou Ngom, who specializes in Ajami writing. Ajami is the name given collectively to modified versions of the Arabic script used to write various West African (non-Arabic) languages. Once you set aside the ridiculous title of the article – Ajami is only ‘lost’ in the ethnocentric sense that most Western scholars don’t know about it and is most definitively not a language, but rather a set of writing systems, each used to write a specific language – it’s an interesting look at a neglected subject relating to an area that is often misperceived as illiterate and having made no contributions to intellectual life.

But what interested me most about the article were the two photos of Ajami manuscripts – one right at the top of the article, another around two-thirds of the way down. And while, yes, I may be the only person to find this really striking, but both of the pages are numbered using Western numerals (51 and 7, respectively). In virtually any handwritten and printed Arabic literature, the set of Arabic numerals ٠١٢٣٤٥٦٧٨٩ are used, not the Western numerals 0123456789. In fact, these numerals have been one of the most resistant to being replaced by Western numerals, even as other regions of the world, such as Japan and India, have partially or fully abandoned their traditional numbering systems. That these Ajami texts are paginated in Western numerals is thus notable, and raises the question of how widely this practice has spread. Is it just chance that these two texts selected happened to use Western numerals, or is this a systematic difference between Arabic and Ajami texts? And if it is a real difference, when and in what context(s) did it emerge? Sounds like a good project for a master’s thesis.