Thursday 6 December 2018

Economist finds WALS; plays with features; gets publication

Update: Since I wrote this, there's been more discussion on Twitter, with some people thinking that linguists just don't want other academics stepping on their toes. That's definitely not the case! Other suggestions are that linguists have been trained to have a knee-jerk reaction against anything Sapir-Whorfian. That probably is true for a lot of people (including me, actually, which is why I try to be aware of that), but I hope it's clear below that that isn't my problem with this article.
There has also been an open letter started and signed by many linguists asking for the paper to be retracted. I think this is a mistake; there is nothing fraudulent or ethically wrong about the paper. It's just not a very good paper, in a not very good journal. Asking for it to be withdrawn sounds a bit like censorship to me. The review process is where crappy work should be stopped, and that process failed here, so it's worth bringing that to the journal's attention, but seeing as this publicity will have brought them many more readers I doubt they're too bothered about fixing it for the future.

Anyway, here's the post as I wrote it before I thought about all these things.

===
Linguists love it when economists do linguistics. Linguist Twitter was a super fun place to be when this article came out. It argues that languages that can leave out pronoun subjects (like Spanish), are spoken by people that have lower levels of education due to their more collectivist culture. I know right?

No need for me to point out all the ways in which it is wrong and incorrect and foolish; Joe McVeigh spent a happy while with a whisky or two doing that.

The thing is, it's actually not unreasonable to write articles like this. There are a lot of very credible papers that show that our attitudes are easily influenced by factors like what our language encodes. There's ones about noun gender affecting how elegant or sturdy we think bridges are, directionality affecting our ability to orient ourselves in our environment, and so on. And psychological experiments seem to show that it only takes a bit of a reminder that we're female to make us do worse on maths tests, etc. Some of these studies maybe are not as robust as they look, and I don't know about the reliability of the psychology ones, but the point is they are by linguists and psychologists and they look credible. So why wouldn't you write an article showing how some facet of language influences your behaviour?

Well, perhaps if you're not only not a linguist, but you also don't know anything about language and don't ask anyone who does, and you don't do it very well.

I don't know the economics dataset that the author uses, but I do know the linguistics dataset very well. It's the World Atlas of Language Structures, which I love very much. This author, Feldmann, uses it because it "provides the most authoritative information on a large number of languages". It does indeed cover a large number of languages, but there is no reason to say it is "the most authoritative". It's compiled from published grammars. Many of those are careful, detailed, accurate descriptions of the language; others are a hundred years old, written by someone who didn't necessarily have much linguistic training. You have to be careful and check those sources out. His only reason for saying it's "authoritative" is that an economics reference says it is, using that same word, and then he cites them with a glaring error in a Spanish example ("yo ablo").

Another thing is that it doesn't control for languages being closely related unless you ask it to, and to do that you need the CD-ROM version, not the online free version, and there's not indication of the method the author used so we don't know if he did that. He just says that he looked at 103 languages. 711 have this information in the free version; I can no longer use my CD-ROM copy as I don't have a CD drive in my computer any more :( so I don't know what subset he took. For example, if you take all the languages spoken in Northern Europe, it's not so surprising if most of them require pronominal subjects, because they're all related. It's better to take a genus of language to avoid skewing your results. Maybe he did this; we don't know.

His citation is poor; his linguistic sources are old or eccentric or missing or simply odd choices. They look like the citations of someone who hasn't read the linguistics literature or asked anyone who has. He doesn't give any sources at all for his claims about collectivist cultures not wanting girls to be educated, which is a big claim and one that really needs backing up.

Go ahead; make claims about culture based on linguistics. They don't tend to stand up to much scrutiny, but maybe yours will. But don't exoticise those people because of it, and don't base those claims on superficial data with no referencing or linguistic research.

No comments:

Post a Comment