Showing posts with label languages. Show all posts
Showing posts with label languages. Show all posts

Monday, 11 April 2022

At last, someone has written about Wordle!

I've held off on blogging about Wordle, because everyone else did it, and because I didn't have anything particular to say. People tend to assume that if you're a linguist, you like word games, but I don't think that's any more true for us than for normal people. Some of us do, others don't. I happen to love crosswords (because there is a quiz or a puzzle element) and dislike Scrabble (because I'm not good at anagrams). I do, as it happens, love Wordle. I love logic puzzles like sudoku, and this is basically just a logic puzzle with an added constraint. 

There is, or used to be, a board game called Mastermind which was a pure logic version of Wordle. (If you don't know what Wordle is, by the way, I don't know where you've been. It's what we all spent the early part of 2022 doing.) There, the thing you had to guess was the sequence of coloured pegs. There were only a few colours, and only a sequence of four, so much fewer than the 26 letters and five slots that Wordle involves. And you needed like ten goes to get it, rather than the six that you get with Wordle. The rules were the same: you got told if you'd got one right and in the right place, or right but in the wrong place, or wrong. You weren't told which one, though, which did make it harder in that respect (otherwise it would have been incredibly easy). I loved this game and I'm not sure why I never had my own copy (maybe no one else liked playing it with me, or maybe I never mentioned that I liked it?) but I played it when I was at other people's houses. 

So yes, I do love Wordle, because of the logic puzzle aspect. The word part of it does add something interesting for me, though. I like the constraint it puts on the possible answers. It's not the case, as in Mastermind, that every combination is equally possible. Some just aren't, or are much less likely, and that's due to the rules of either languages in general, or English in particular. So an example of a language-in-general thing is that there are going to be some vowels in the word, and some consonants. An example of an English-in-particular thing is that the last letter is probably an 'e' or a consonant, because we don't have so many words that end in 'a', 'i', 'o' or 'u' (though we do have some, so it's not absolutely ruled out). Another English-in-particular thing is that if you know you've got an 'h' in there somewhere, it's possibly the first letter but if it's not, you've likely also got a 't' for 'th' or a 'g' for 'gh' in there. Not always; ahead would have stumped me in that case. 

Screenshot of my Wordle stats showing a normal distribution with most words taking me four guesses to get.

I've been paying attention to how I solve them, and I usually get the answer on the fourth go. I imagine this is true for most people, as we'd expect a 'normal distribution' with very few right on the first or second go (that's a lucky guess) and few taking six (that's some bad luck or a word that has many very similar to it). 

I'm not sharing any new insights on how to solve them – I just do the same as you all do and rule out the most common letters first until I can see what it's likely to be. But what interests me is how quickly you get to the point where it can only realistically be one word. This is normally where I am by guess four. 

Here are a couple of recent ones, where the answers were epoxy and lowly. Just coincidence that they both end in a 'y', I think. I vary my starting words but always try to include some common letters. Sometimes I just use things I see nearby like the dogs' names. In both these cases, by the time I'd had three guesses I didn't have many right, but I had ruled out nearly all the possibilities, and there was only one possible word that I could think of in each case that could fit what I knew. 


Screenshot of Wordle with the word 'lowly', correct on the fourth go with few correct letters on the previous three. The previous image shows the same but for 'epoxy', but I can't edit the alt text for some reason.

This is the most satisfying way of playing the game, I think. If you end up with only one letter to get and several possibilities, it becomes chance and annoying, and if you get it right with some lucky guesses you don't feel like you earnt it, whereas this way you feel happy that you worked it out. 

I also saw Lesley Jeffries talking about doing it in other languages, and noting that her guess distribution was much more spread, presumably because her vocabulary is not a large in those languages and so she is likely to need more goes to get it right than the average speaker of that language would (and she noted that she is relying on phonotactics, which is those rules of the language that I mentioned earlier). 

Wednesday, 6 January 2016

Ne translatez pas les languages

A friend posted this photo on facebook the other day:



A number of highly intelligent people then missed the point of the joke and began to comment on how bad the translations are. I think they're pretty good imitations of the languages in question, with one big error: the 'German' looks (unmistakably) more like Dutch in the second half of the sentence.

These are not intended to be translations, of course, or rather they're deliberately not accurate. They're just meant to amuse the English-speaking audience by including funny words to compound the humour of the warning in English ('avoid pouring on crotch area'). After all, we love nothing more than when a foreign language does something in a funny way (cf. Welsh popty-ping for 'microwave' or German Handy for 'mobile phone') so it's nice to imagine that these might be for real.

Let's begin with the French. The grammar is fine, as far as I know: you'd make an imperative in French with the negative and the 2nd person plural inflection, just as it's done there. The phrase dans l'area seems OK to me, too. The vocabulary used just isn't French, that's all. The verb for 'pour' should, I think (Google Translate helped) be verser so you'd have ne versez pas. I don't know if you'd also need a pronoun in there (don't pour it) or not. And, of course, no French person ever says ooh-la-la, but it's stereotypically French and referring to ones' crotch that way goes nicely with the French reputation for romance.

Next, the 'German'. This one caused a bit more controversy in the comment thread, because it very obviously isn't German. My German is less good than my French, but I'm pretty certain the word order here is wrong as well as the vocabulary and morphology. I think you would say literally 'drop you not...' rather than 'not drop...', which is what we appear to have here: nein is German for 'no'. Then the verb is obviously just English again, with droppen instead of pourez. I think they've simply selected words that have a combination of letters that resemble the language in question (so French has a word pour, for instance, but it doesn't look very Germanic).

Next, though, I think we've got a nice case of representing an accent in words that look like (or indeed exist in) the language. So ze haut kaffe does not mean 'the hot coffee' in German (haut apparently means 'skin', for instance, and the German for 'coffee' is in fact kaffee, which they could have used instead), but it looks German-ish and sounds like someone saying the hot coffee in (some kind of representation of) a German accent. It reminds me of this a bit. Notice also that the 'German' has an overt object with article, while the French has nothing at all ('don't drop _ on the crotch area' vs 'don't drop the hot coffee on the crotch area').

Then it most definitely switches to Dutch-looking words, if we hadn't already. Dutch word order is a little bit more familiar to English speakers as well, I think, although I know even less Dutch than I do German. Anyway, here we've got a dead giveaway for Dutch: that word oont. It just has a Dutch-like feel to it, though I don't know why (double 'o'? final 't'?). And then finally the lovely phrase, ze knakkers. Again, this is definitely Germanic and in fact Google Translate does give 'knackers' as the translation of this as a German word (it gives 'frankfurter cherry' if you tell it knakkers is Dutch!).

Good work, coffee-cup-humour-producing-person!

Thursday, 13 March 2014

Code-switching contrastive emphasis

I'm watching Salamander. It's a kind of political thriller, lots of running about, photocopying and suchlike. It's also Belgian, and the dialogue is mostly in Flemish. It's a bit complicated to explain the situation, because 'Flemish' is Dutch as spoken in Flanders, but then some varieties (eg West Flemish) can be considered separate languages, and Flemish is a cultural label as well, and... well. Anyway. They're speaking Flemish for most of the time. Every now and then, though, they switch into French, either for a single word, a few words, or some of the characters seem to use French as their preferred language so you get a whole conversation in it. You also sometimes get 'parallel talk', where one person in a dialogue uses Flemish and another uses French.

One such scene took place, in which a dialogue took place between about five people, one of whom was using French. He said 'We have to know who our enemy is', and a Flemish speaker replied 'enemIES', stressing the fact that there are more than one single enemy. This is contrastive stress, and is a nifty way of indicating that you're contrasting something you've said with something the other person either said or implied. The contrast can be lexical/semantic: I want BEANS (not cheese), or it can be grammatical, as you can see here, where singular is contrasted with plural. Normally you need to contrast two things that are similar in some way, like two nouns/foodstuffs/potato fillings.

What's special about this is that the two languages do plural in different ways. French adds an -s in the written language, going from ennemi to ennemis. But in fact, in speech, you won't usually hear that -s, and the only thing to tell you the difference is the 'determiner', an article, demonstrative, etc. In this case it was 'our', which in French is notre for singular and nos for plural. I didn't catch the exact word in Flemish but in Dutch it's vijand for 'enemy' and the plural is marked with an -en: vijanden. The possessive determiner stays the same, and in any case the Flemish speaker didn't repeat 'our'. So we had this:
A: (We have to know who is) notre ennemi. 
B: vijandEN.
The Flemish speaker contrasted a possessive determiner with a totally different morphosyntactic category, a plural inflection, because he was contrasting the feature [number], which is encoded on the determiner in French and the noun suffix in Flemish.

Wednesday, 15 May 2013

In which continent is Honolulu?

In linguistics papers, we often need to give examples of some linguistic construction or other, and so we use example sentences. It didn't strike me as at all odd until I talked to other people about linguistics, and they found it hilarious. Still. It's what we do. What I mean by this is that, say I'm writing about questions, and I need to illustrate what questions look like in various languages, I'll just give an example of a question. It doesn't matter what question, usually; it just has to be a normal question with nothing syntactically weird going on. So I might give these two examples:

Cantonese:
neih johng hohk sahn ma
‘Have you  rushed into the crane god, eh?’ (Morrison 1828, cited in Cheung 2001: 223)

Mizo:
vok i2-­‐n vulÊ” doon1 em2
‘Are you going to raise pigs?’ (Chhangte 1989: 162)
I can generally be relied on to pick the silliest examples I can find, so my thesis contained lots of examples involving trousers being covered in bitumen, and expressions with rude words in them. You don't always have a lot of choice, if you don't speak the language, but if you're working on a language you speak, you can make up good examples. I despair of the number of papers I read in which John lends Mary yet another book. In my constructed examples, all sorts of things happen: gorillas get seasick, Poirot solves murders, Richard Burton is a post-mortem actor, and we have to choose whether we would rather be able to fly or be invisible. (You have to make sure you don't go too far, though. The Generative Semantics movement of the 1970s got a bit silly in this respect.)

Reading examples in other languages can be informative: I learn the words for all sorts of things in other languages, for instance. Did you know that the Cantonese for 'Christmas' is literally 'Jesus birthday'? And the Swahili for 'lion' is 'simba'?

But today I was reading a paper and was baffled. It's Hamblin's classic 1958 paper on questions. In it, he discusses how some questions include a presupposition. The (in)famous example of this is 'Have you stopped beating your wife?', which the defendant cannot answer 'yes' or 'no', because either answer presupposes that he has, in the past, beaten his wife. Hamblin notes that a question like 'In which continent is Luxembourg?' also contains a presupposition (that Luxembourg is in a continent), but that it is unimportant because the presupposition is true. He gives this question as the equivalent example where the same presupposition would be untrue: 'In which continent is Honolulu?'. You cannot, says Hamblin, answer by simply saying what continent Honolulu is in, because it is not in a continent at all.

Clearly, this is now false because Honolulu is in Hawai'i, and Hawai'i is in the United States, and the US is in America (or North America - do we class North and South as separate continents these days or not?). In 1958, when this paper was written, Hawai'i was not yet a State: it became one in 1959. But it was a Territory, and so, I thought to myself, surely still considered to be in the continent of (North) America?

Anyone any idea?