Logarithmical: Zipf’s Law and the mathematics of punctuation

Let’s try an experiment. If we start with some large body of text — post-war American novels, say, or twentieth-century British newspapers — and count all the occurrences of all the words in those texts, we can put together a fairly accurate list of the most popular words in English. The word “the” would be at the top, followed by “of” and then “and”. With this list of word counts in hand, you could turn to any other similar body of work — British novels or American newspapers, for example — and have a good idea of how often you’d expect to find each of the words on your list. Simple enough.

Read more →

The imminent death of the paragraph

As I mentioned last time, I recently took part in a workshop on the subject of “punctuation in practice”. My presentation there was titled “Ghostwritten: the vanishing pilcrow”, and it traced the life of the paragraph mark from ancient marginal dash (—) to medieval capitulum (¢) to pilcrow (¶), as shown in the slide reproduced above, and finally to empty space, or paragraph indent ( ) — all things I’ve talked about here, and in the Shady Characters book, at some length.

Read more →

Miscellany № 62: the absence of punctuation

Just the one punctuation-related link this week: Shady Characters is upping sticks and moving to London this coming week, and blogging time is scarce!


So: to Canada, where the National Post recently reported on a PhD thesis that contains no conventional punctuation.1 Submitted to the University of British Columbia by architect Patrick Stewart, a member of the Nisga’a First Nation, each of the chapters of Stewart’s dissertation opens with a summary written in standard academic English but the bulk of the work is presented without uppercase letters, full stops or commas. Stewart holds a select few marks in reserve for more troublesome concepts — a forward slash “connects words of similar meaning / emphasis”; an ellipsis “indicates a continuity of thought”; and the question mark survives intact — but, on the whole, his thesis is, as he describes it, “one long, run-on sentence, from cover to cover”.

Read more →