Idiosophy

A physicist loose among the liberal arts

Author: Joe Page 29 of 32

Site manager

Pitfalls of Scientific Analysis

I’ve mentioned my qualms about bringing science to bear on the world-building of a fantasy author.  The Web provides a brilliant example of what I’m talking about.  Lyman Stone knows about demographics and geography, and tries to apply them to Westeros, from A Song of Ice and Fire. It doesn’t go well for George R.R. Martin (a.k.a. “Railroad”, many years ago).

Martin is interested in the contention between noble houses, and contorts the world around the aristocrats as necessary to set up the scenes he has in mind.  He hasn’t made any effort to ensure that there are sufficient agricultural populations to support the cities, or that the ethnic diversity of the population matches the speed of transportation and communications.

And that’s OK. The book he’s writing doesn’t need all that. But you have to be careful — you can’t increase your appreciation for a book by walking around the backdrops and looking at the hastily-nailed lumber and spilled paint.  You have to stay on stage.  Or, as C. S. Lewis put it in Meditations in a Toolshed, you have to look along the story, not at it.

Elegantly put, Mr. Lewis.  Now, how do we define a coordinate system so I can  place scientific disciplines on the proper axes with respect to a work of fiction?


P.S.  Some ASoIaF fans who don’t read very closely laid into Mr. Stone’s analysis, and were duly smacked down in a follow-up post.

Patrons of the Humanities

Writing from his perch above the Ice Bay of Forochel, Brenton has a proposal for ordinary people to sponsor humanities scholars, with expenditures beginning at zero dollars. As a good blog post should, it provoked a swift series of associations in my mind.

  1. Cool! I can be a Renaissance prince!
  2. No, wait, I would hate that.
  3. This is what Patreon and other crowdfunding sites are for. This is more personal, but the dollar figures will be much smaller.
  4. Thinking back to my days as an impecunious scholar, some of these would embarrass the daylight out of me.
  5. In a modern liberal democracy, we’re supposed to set up public organizations to do the things that used to rely on princely patronage. That solves the stabby/poisonous parts of #2 and replaces the embarrassment in #4 with the tedium of filling out forms. (Much better.)

We’ve built things like the National Endowment for the Humanities here in the U.S. of A., but they’re chronically under-funded.  The checkbook is under the control of people who want the agency to be as small as they can make it.  But that point connected me to something else I think a lot about.

Economic interlude

Depressing though it is these days, I stay abreast of macroeconomics.  Since the financial crisis of 2008, the fundamental problem in the developed world has been that all the rich people want to sit on their money. The economy is sluggish because the demand for money is so high that the demand for everything else is too low, so people don’t have work. The interest rates that would re-launch economic growth (according to most models) are less than zero. But that’s not generally possible, so holding cash is better than investing it.

Now here’s where things get weird: in conditions like this, the way to fix the problem of too much demand for money is to print more money. And then you give it away to people. In some people’s imaginations, you just drop cash out of helicopters, but perhaps we can think of something less contusive. With a lot more money sloshing around the economy, that will generate some inflation, which is exactly what we need. The rich people (which includes sovereign wealth funds from oil-producing countries) will see their bank accounts losing value, so they’ll look for things to spend it on before it diminishes. That will re-launch demand, and put everyone back to work.

We’ve even gotten to the point that the old idea of the Universal Basic Income is coming to life again. It differs from helicopter money only because it’s a fiscal program initiated by the legislature, not a monetary program run from the Central Bank.

Resuming

What does this have to do with patronage? (Oh yes, patronage. This post was about patronage, wasn’t it?) Universal Basic Income is probably a bridge too far in our current north-atlantic superculture. I’d propose instead a Universal Research Fellowship program. A grant of (I don’t know; I’ll pull a number from Brenton’s post) $25,000 per annum for anyone who wants to conduct research. Recipients would have to submit a prospectus that’s good enough to pass electronic review (which only checks for plagiarism), and produce an annual paper in the public domain.

Benefits: relaunching the economy; removing a bunch of people from jobs they hate; employing rafts of detectives to track the free-for-all in identity fraud; new golden age of the humanities.

Disadvantages: Brenton’s call to action has the advantage that it’s something that an individual can do, so it’ll probably happen much sooner than my idea. But once the robots take all the manufacturing jobs, collective action will be the only solution big enough to match the problem.

Two-fingered

Several people were live-tweeting the proceedings at the C.S. Lewis & Friends Colloquium, and from them I learned that Sørina Higgins had described Lewis’s poetry as “metrically competent”. Sick burns from faint praise, all in a spirit of good fellowship.  Well, I know a double dactyl when I see one, which led to this.

Higgledy piggledy
Metrically competent
C.S. “Jack” Lewis sat
Down with his pen.

“Time to get on with it,”
Polysyllabically
Counting on fingers, he’s
Writing again

Stealing presentation ideas

Brenton Dickieson recently discovered an unpublished preface to The Screwtape Letters that shows C.S. Lewis originally thought of them as part of the Space Trilogy (and incidentally making him the intellectual forefather of Douglas Adams). He gave a talk about it at the Lewis & Friends Colloquium, and has posted links to a recording of the talk and his presentation materials on his blog. It’s excellent;  30 minutes well spent.

There are three ways that talk is relevant to my project here.

  1. He knocks the “so what” question out of the park.  Easily half the time he’s talking, he’s suggesting ways that the connection can be used to give a new perspective on perennial questions about Lewis’s work. It’s a useful model for how to talk about my “discovery”, though I don’t have such a clear picture of the future.
  2. He gave the talk as a first-person narrative, whence I shall claim legitimacy for my own upcoming presentation. Writing it has been a continual struggle to keep it in the mode of an academic paper.  Releasing myself from that straitjacket will make the rest of the writing much easier.
  3. …and this is the big one, Brenton tossed off a casual side remark to the effect that “world-building is what we’re really interested in, here.” That crystallized for me the structure in which I’m working.  (As I’ve mentioned, I am so slow on the uptake that I make Butterbur look like a quicksilver wit. This is another case of that.) Science can meet Speculative Fiction scholarship to their mutual benefit, if we focus on how the authors build their worlds.  In a way, I’d be reverting to a pre-Enlightenment approach to science, except studying Nature to learn about the intention of the sub-creator.

The last one is really obvious, but there’s a reason the penny hadn’t dropped. This is a way of reading books that I turned away from, decades ago.  Lots of books can’t stand the scrutiny.  Even when I was a child, I had to stop myself from throwing a book across the room when I read a line like, “slow the ship down, we need to be in a lower orbit.” After I’d spent a couple of decades of school, it got worse. I was catching even authors who brag about their mathematical prowess (looking at you, Robert Heinlein) in physics mistakes. If I’d stayed picky, I wouldn’t have had anything to read! The trick to making it work here will be first to discern what parts of the primary world the author was using to convey meaning, and confine my attention there.

Persistence of Family Names

If I still had any doubts about how much we can rely on the persistence of family names, Matt Yglesias just fixed them.  The article reports the results of a study of wealthy people in Florence, Italy in 1427 and 2011.  The richest people today have the same family names as the richest people in the fifteenth century.

The original paper is by Barone & Mocetti of the Bank of Italy.  More information is in a column from the Center for Economics and Policy Research.

Proving a Thesis and its Limits

Prof. Olsen’s Dracula Lecture 8 includes a special bonus rant on the wrong way to write papers about literature. It matches up marvelously with the next section of my paper. The issue, in a nutshell, is that if students think up a thesis and then look for evidence to support it, they can usually find some.  Which is a good first step, but it doesn’t go far enough. Stopping there lets the writer get away with a thesis that’s not necessarily true. Ideally, the writer should also collect all the evidence that the thesis is wrong, and then decide which set is more convincing.

This is one of those cases where being a scientist helps.  Standard methods for data analysis take contrary evidence into account on an equal footing with supporting evidence, so the subject of Prof. Olsen’s rant is one of “the blunders we didn’t quite commit” (in Piet Hein’s words).

Which brings us to the core of the paper:  how do the regions of England that provide the names of hobbits relate to their role in the story?

Hypothesis: Family names from Birmingham or the West Midlands are close to the Narrator; names from other parts of England indicate families to be kept at arms length; and names that aren’t found in England indicate families that are liminal or distant from the Shire.

regions of England

Administrative Regions of England

I’ve previously defined the categories of families. The regions of England are from Wikipedia.  Birmingham, where J.R.R. Tolkien grew up, stretches from the “W” to the “a” in “West Midlands” now; it was much smaller a hundred years ago.

These are administrative regions, but I’ve checked with an English colleague, who confirms that the regions have cultural significance as well as political.  If they were both in London, for example, a person from Warwickshire and a person from Shropshire would agree that they are almost neighbors, as if  they came from the same place.  (An example of the opposite case would be a Virginian and a Marylander. We don’t feel like we’re from the same place, even when we’re both in California.) So it makes sense to include everyone from the West Midlands in a single category, which is essential to this project because the heat-maps are only that precise.

role vs. region.

Hobbit families, by region and role

When we count the number of hobbit families in each group and region, the relationship looks like this figure.  Birmingham names are dominant among the “close” group and rare among the others.  Names from other parts of England are almost as common among the close group, dominate the “arms-length” group, and drop off in the other groups.  Names that do not appear commonly in England are steady across the four groups.  Of the three clauses in the hypothesis, the first seems likely true, but the second and third are dubious.  Not so good.

group vs. region, weighted by importance

Hobbit families by group and region

All names are not equally important, though.  When the importance of each family to the story is included, the graph looks very different.  Important characters with Birmingham names are overwhelmingly close to the narrator.  Other English names dominate the “arms-length” group, as we expect.  The high value of the red line in the “close” group is almost entirely due to Sam Gamgee, as we noted ‘way back at the beginning of this project.  (If Sam were “close”, the red line would drop to 15 at “close” and the purple line would jump up above 35. More on that later.) The big spike of important, non-English names in the “liminal” category is mostly due to Merry Brandybuck.  “Distant” families aren’t important at all.

So, to take us back to the top of this post, the preponderance of the evidence supports the hypothesis. The “Birmingham” line slopes sharply downward, the “Middle-Earth” line of names that sound strange slopes upward, and the “England” line of names that should sound like they’re from far away is in between the two.  The causality runs only one way: if we’d tried to prove that families close to the Narrator were from the West Midlands, the first graph wouldn’t agree.  (Only about half of the “close” families are from there.)  Using a scientific approach tells more than one side of the story, and sets limits on the strength of the conclusion.  With that I shall close, and amuse myself by imagining the look on the face of my high-school English teachers if I’d ever turned in a paper with graphs in it.

Seward’s Folly

Dr. Seward, the narrator of a large part of Dracula, sometimes seems like he’s there to make the reader feel relatively intelligent.  His inability or unwillingness  to comprehend things outside his experienced make him, despite his self-avowed erudition, the last person to understand what’s going on.

Dr. Seward refers to himself as a “sceptic” four times over the course of the novel. Old Pyrrho being unavailable, I’ll step in to say that’s not really what he is. Skeptics don’t believe absolute knowledge is possible, and that includes their own preconceptions. Seward has a solid base of things he knows, and anything contradicting it gets disregarded.  Skeptics doubt their own working assumptions and even the framework in which they reason, the same as new information they receive.  Dr. Seward isn’t doing that at all.  In terms of Bayesian logic, he’s reasserting strong prior probabilities in the face of evidence to the contrary. There’s a word for that: the economist Noah Smith calls it “derp”.

Wait – what’s a “prior probability”? Bayes’s theorem is one of those amazing mathematical results that sits there for centuries before anyone really gets its significance. The basic idea (and you can look to Dr. Smith’s blogpost for a better explanation than mine) is that every thinker has a certain prior base of knowledge that she uses to interpret new information.  As new information comes in, it modifies the odds of each thing in the base, leaving the thinker with a new “prior distribution” of (in this case) the likelihood that each possible cause gives rise to future observed effects.  The mathematical operation that makes that happen is multiplication. One immediate result, therefore, is that if your prior distribution says the likelihood that thing X caused event Y is exactly zero, then the new information gets multiplied by zero.  There’s no amount of new data that can make you think X is really going on.  Dr. Seward has a prior distribution with zeroes assigned to everything he didn’t learn in school.  A skeptic uses a prior distribution with no zeroes in it at all (like a bell curve), because those zeroes are awesomely powerful things, and they’re not to be trusted.

Perhaps I’m being too hard on the good Doctor, but I feel justified because there are examples right next to him of better ways to reason. Characters who use prior probabilities derived from literature seem to work much better. Mina Harker, for example; at times she seems to be the brains of the whole outfit. Why do I say her priors come from literature? Mina may have taken Corey Olsen’s Faërie and Fantasy class (or the 19th-century equivalent).  She knows how to compose an oath so it doesn’t later cause trouble in an entirely-predictable way.  Maybe this is why Prof. Olsen says she’s “awesome”.  Mina reasons from stories. Here’s how she swears never to read her husband’s diary: “I would never open it unless it were for his own dear sake or for the sake of some stern duty.” (Chapter IX)  When I read that, I said, “Brava!” [1]   She drew the crucial lesson from Arthurian romances – be really careful how you swear oaths.  That “unless” clause made the happy ending possible.  (Oops – spoiler!) The sons of Fëanor should have been so wise.

Jonathan Harker has a similar skill at hedging his oaths, though I’m sure his prior probabilities come from law school.  On the expedition to Dracula’s castle, Mina tries to make him understand that her life is secondary in importance to ridding the world of vampires. “’Jonathan, I want you to promise me something on your word of honour. A promise made to me, but made holily in God’s hearing, and not to be broken though I should go down on my knees and implore you with bitter tears. Quick, you must make it to me at once.’
“’Mina,’ I said, ‘a promise like that, I cannot make at once. I may have no right to make it.’” (Chapter XXIV)  You have to love that “may have” — he won’t even commit to that, without consulting his books of precedents.  Any knight of the Round Table would have sworn instantly and suffered for it for the rest of the poem.

The similarity of Mina’s and Jonathan’s thought processes raises a question.  Instead of school, did Mina learn this mode of thought after meeting Jonathan, to be a better wife?  Doing such a thing would be consistent with her character, since it’s not much more difficult than memorizing Transylvanian railroad schedules without speaking Romanian. But I prefer to think that it’s the way she was educated. Victorian girls were taught by literary example (I admit it: my own prior probability distribution is influenced most heavily by Alice’s Adventures in Wonderland). I’m sure that a mode of thinking so consistent with the British legal system was one of the salutary qualities that attracted Jonathan to her in the first place.

[1] Some people write in the margins of books.  I talk to them.[back]

I’m in!

The NY Tolkien Conference 2016 has accepted my paper.  They’ve posted the first iteration of the program, and I’m on it.

They’ve extended the deadline for submissions to June 1st. This has happened with depressing frequency at aviation conferences in the last year – the paper I’m presenting Tuesday at I-CNS is one I submitted when I found out about their extension.

Surname Mapping

An annotated bibliography of surname mapping. Research by James Cheshire and his collaborators underlies this ardagraphic study.  Dr. Cheshire has a blog in addition to his university site linked above.

Oliver O’Brien, Suprageography

O’Brien’s data visualization blog post got this project started.  The public-access web portal provided the qualitative data for classification of family names.  If you don’t have an English name, the latter site hosts a world-wide version (at a much lower resolution).

Cheshire, James A., Paul A. Longley, and Alex D. Singleton. “The surname regions of Great Britain.” Journal of Maps 6.1 (2010): 401-409.

The map of surname regions in Great Britain shows that distributions of names track well with the administrative regions.  The map itself, available for download at the link (17 MB) is gorgeous.

Paul A. Longley, James A. Cheshire, Pablo Mateos, “Creating a regional geography of Britain through the spatial analysis of surnames”, Geoforum, 42, 4, July 2011, Pages 506-516.

Mapping names in the 21st century is valid for this practice because Longley, Cheshire, and Mateos’s techniques make it possible to identify “combinations of location specific surnames that date back 700 or more years”.   Figure 5 shows that the  “Lasker distances” between Census Area Statistics Wards in a region cluster into a tight grouping, and each region is unlike other regions in England.  In fact, some geographic resemblances are visible through the multidimensional clustering:  wards in the West Midlands look like they feel a gravitational pull from Wales, as names originating in the Welsh language diffuse across the English border.

The Lasker Distance is elegantly simple. If we write the fraction of people in a small area i who have the name n as p(i,n), then the distance between areas i and j is -ln(Σ p(i,n)×p(j,n)) where the summation is over all names.  Names that don’t exist in one of the areas don’t contribute to the sum.

Once the distance in “name-space” between population points is established, the next step is to cluster the points in that space, and set the cluster sizes so that the result is interpretable in geographic terms.  The method used here is “k-means” clustering, and I hope I’m not being uncharitable if I describe it as “try every possibility and keep those that work”.  That’s unfair, of course — independent consistency checks are applied at each step; the choice isn’t arbitrary.

Cheshire, James, Pablo Mateos, and Paul A. Longley. “Delineating Europe’s cultural regions: Population structure and surname clustering.” Human Biology 83.5 (2011): 573-598.

Figure 7 in this paper shows the relationship between physical distance and Lasker distance for the countries they studied in Europe.  Culturally homogenous places like Poland and Luxembourg show a tight cluster of points, lying on a line that’s almost horizontal.  That is, you find the same names, no matter where in the country you go.

Scatter plot of Lasker and Geographical distance

Some aspects of cultural history are visible in this figure copied from Cheshire, et al.

Countries unified by language, such as France, Italy, and Germany, show a slanted line (on a log-log plot), with a moderate upward slope.  The further apart two villages are, the more likely you are to find different names in them.  (France has a small Alsatian tail.) Norway and Denmark are fascinating exceptions:  the line slopes downward! I’m just guessing here, but it could be due to the fact that until recently you didn’t get from one place to another by land.  By sea, travel times depend on wind and currents as well, so genetics and patronymics can have a more complicated relationship with distance.  (There might be a follow-on project, there, if I could only find family names in the Sagas.)

Spain has two distinct parts:  One for the mainland and and one for the islands.  They’re identical with respect to names.  The mainland isn’t a long, thin shape, it’s an incoherent blob, caused by mixed Catalan, Spanish, Arabic, and possibly a Basque scattering off to the side.

The United Kingdom is a dense horizontal sprawl of English, with oddly-shaped protuberances of Welsh, Scottish, and Irish that make drawing a best-fit line through the points an exercise in graphical uniformity, not statistical rigor.

they
are always interested in technical
details when the main question is
whether the stuff is
literature or not
archy

Beyond Good and Evil

The criticism of The Lord of the Rings that annoys me the most, and I think I share this opinion with most fans, is when people say the characters are black and white; bad guys are wholly bad and good guys are wholly good, and never the twain shall meet.  These criticisms are made by people who’ve never tried to classify the characters into those groups.

In the post that started this ardagraphic quest, I used the term “bad guys” because I was joking.  Now that I’m seriously trying to make something of that work, I need to replace it with something relevant to the text.  The utility of the good guy/bad guy distinction fell apart for me when I tried to classify Lobelia.  She’s built up as a villain all through the first three chapters, but you have to love an elderly lady, two feet tall, attacking a six-foot oppressor with her umbrella. In any case her repentance at the end, which leads her to give Bag End back to Frodo, ought to disqualify her from the “bad guy” label. “Bad guy” is only useful when talking about Uruk-hai or Bill Ferny.  I hereby abandon it.

A better classification comes from my own experience living in Virginia.[1]  It’s not so much good and bad people, as there are the people you keep close to you, and those about whom you always find yourself saying, “Bless his heart,” (if you’re a woman) or “That’s just Joe” (if you’re a man). They’re not bad guys, per se, but they frequently seem to act in a way that interferes with other folks getting on with their lives. It’s good practice to keep them at arm’s length.

Separation, then, is the classification I’ll use.  The hobbits themselves talk in those terms, and the narrator reinforces it.  I’ll use the terms “close” and “arms-length” to describe the two types of characters above.  Bagginses, Tooks, and Gamgees are “close”; Sandymans and Sackville-Bagginses are “arms-length”.

I see two other kinds of hobbits, besides these Hobbiton types.  First are the fringe elements, who are perceived as being a bit strange and often uncanny. “They still had many peculiar names and strange words not found elsewhere,” the narrator says about the Brandybucks.  (Prologue, i) Likewise the Hornblowers, from ‘way off in the Southfarthing, who “had hardly ever been in Hobbiton before”. (I, i) Since this is a scholarly work, I won’t call them “fringe”.  I’ll say “liminal”.

The last category are the “distant” hobbit-names.  Hobbits in Bree have them.  Frodo uses the name “Underhill” when he’s in Bree because Gandalf (who’s been everywhere) knows that someone who hears it won’t think of the bearer as living anywhere near the Shire.

graph of names and importance

Fig. 1. Classification of Hobbit family names

The counts work out to 13 Close families, 5 Arms-length families, 4 Liminal, and 5 Distant.  These numbers are big enough to be just at the threshold where it doesn’t look silly to put them on a histogram. In Figure 1, the blue bars are the counts of families, and the red line is the sum of the importance of each family in the group.  The distribution of importance is also reasonable; distant characters are less important to a hobbit (and to a story), and the weighting shows that effect.  Families in the Liminal category are slightly more important to the story than those in the Arms-length category because of the presence in the former of Merry Brandybuck.


 

[1] This is not crazy, as the original post hints.  My (English) ancestors, like other long-time Virginians, originated in the West Midlands near Tolkien’s boyhood home, but they left to come here between 1619 and 1750.  They missed out on the birth of the Industrial Revolution in Birmingham, and kept an agrarian lifestyle until recently.  It’s reasonable to conclude that they’re exactly the kind of peasants JRRT had in mind when he imagined the Shire.[back]

Page 29 of 32

Powered by WordPress & Theme by Anders Norén