Category Archives: Translations

Arabic, French and English Versions of ISIS’ Claim of Responsibility for the Paris Terror Attacks

The initial statement was released in French and Arabic:

Here is the Arabic version first:

Original Arabic version.

Original Arabic version.

The following is the French version:

French version.

French version.

It’s not perfect, but this is the best English translation I could come up with.

In the name of Allah the merciful, the very merciful Allah:

Allah the transcendent has said: And they thought their fortresses would truly shelter them against Allah, but Allah came to them from where they didn’t expect and put terror in their hearts. He demolished their houses by their own hands as well as those of the believers. Learn this lesson, ye who is blessed with foresight. Surat fifty nine second verse

In a holy attack made possible through Allah, a group of believers and soldiers of the Caliphate, from the Caliphate – blessed with power and triumph be it through Allah – targeted the capital of abominations and perversion, the one which bears the banner of the cross in Europe: Paris.

A group which tore asunder its earthly ties chased the foe, searching for death on the path of Allah for the sake of His faith, His prophets and His allies, and the willing humiliation His enemies. They have been true to Allah, and true we consider them. Allah has conquered by their hand, and instigated fears in the hearts of the Crusaders in their own land.

Eight brothers wearing explosive belts and bearing assault rifles attacked precisely chosen determined places in the heart of the French capital.

The targets were the Stade de France during a match between opposing Crusader countries, France and Germany, which was attended by the fool of France, François Hollande; the Bataclan, where hundreds of heathens were gathered for a most perverse party; and many in the 10th, 11th and 12th arondissements simultaneously. Paris has trembled under their feet, and the streets tightened in their wakes. The death toll is at least two hundred Crusaders with many more wounded, glory and praise be to Allah.

Allah made it easy for our brothers by allowing them martyrdom, so their explosive belts went off on the heathens when the ammunition ran out. May Allah accept them among the martyrs and allow us to join them.

France and those who tread its path must know that they remain the main targets of the Islamic State and that they will continue to smell the stench of death for having led the Crusade, insulted our Prophet (PBUH), and boasted about fighting Islam in France and striking the Muslims in the land of the Caliphate with their planes which were of no help in the reeking streets of Paris. This attack is only the beginning of the storm and a warning to those who heed the lesson to be learned.

Allah is the greatest. And power be to Allah and to his messenger as well as believers. But the hypocrites may never know. Surat 63 verse 8.


Filed under Arabic, Europe, France, French, Islam, Radical Islam, Regional, Religion, Terrorism, Translations

Comparison of Inflected Verb Forms in English, Swedish, German and Finnish

Below is an Internet joke about the Finnish language. It shows how Swedish and German are both more complicated than English and in addition, how German is more complicate than Swedish. And of course, Finnish is wildly more complex than them all. You would think that Finnish dictionaries must be Hell, but that’s not the case. Generally only the root is listed, and the inflections are not. It is the same in English dictionaries where only run is listed and runs, ran, and running – the inflections, are not.

Of course, all of the forms below are not separate words for dog. Instead they mean things that would be expressed by a phrase in English such as with a dog, to a dog, from a dog, of a dog, for a dog, in a dog, dog’s. After that, there are the same forms with possessive suffixes such as with my dog, to your dog, from his dog, of their dog, for our dog, in her dog, its dog’s. And finally there are forms that attach to the possessive case forms such as My dog?, Even with your dog?, and Even without our dog.

“English: A dog.
Swedish: What?
English: The dog.
English: Two dogs.
Swedish: Okay. We have: En hund, hunden, Två hundar, hundarna.
German: Wait, I wan’t to try it too!
English: No, go away.
Swedish: No one invited you.
German: Der Hund.
English: I said go away.
German: Ein Hund, zwei Hunde.
Swedish: Stop it!
German: Den Hund, einen Hund, dem Hund, einem Hund, des Hundes, eines Hundes, den Hunden, der Hunden.
Finnish: Sup.
English: NO.
Swedish: NO.
German: NO. Finn, you go away!!
Finnish: Koira, koiran, koiraa, koiran again, koirassa, koirasta, koiraan, koiralla, koiralta, koiralle, koirana, koiraksi, koiratta, koirineen, koirin.
German: WHAT?
Swedish: You must be kidding us!
English: This must be a joke
Finnish: Aaaand… koirasi, koirani, koiransa, koiramme, koiranne, koiraani, koiraasi, koiraansa, koiraamme, koiraanne, koirassani, koirassasi, koirassansa, koirassamme, koirassanne, koirastani, koirastasi, koirastansa, koirastamme, koirastanne, koirallani, koirallasi, koirallansa, koirallamme, koirallanne, koiranani, koiranasi, koiranansa, koiranamme, koirananne, koirakseni, koiraksesi, koiraksensa, koiraksemme, koiraksenne, koirattani, koirattasi, koirattansa, koirattamme, koirattanne, koirineni, koirinesi, koirinensa, koirinemme, koirinenne.
English: Those are words for a dog???
Finnish: Wait! I didn’t stop yet. There is still: koirakaan, koirankaan, koiraakaan, koirassakaan, koirastakaan, koiraankaan, koirallakaan, koiraltakaan, koirallekaan, koiranakaan, koiraksikaan, koirattakaan, koirineenkaan, koirinkaan, koirako, koiranko, koiraako, koirassako, koirastako, koiraanko, koirallako, koiraltako, koiralleko, koiranako, koiraksiko, koirattako, koirineenko, koirinko, koirasikaan, koiranikaan, koiransakaan, koirammekaan, koirannekaan, koiraanikaan, koiraasikaan, koiraansakaan, koiraammekaan, koiraannekaan, koirassanikaan, koirassasikaan, koirassansakaan, koirassammekaan, koirassannekaan, koirastanikaan, koirastasikaan, koirastansakaan, koirastammekaan, koirastannekaan, koirallanikaan, koirallasikaan, koirallansakaan, koirallammekaan, koirallannekaan, koirananikaan, koiranasikaan, koiranansakaan, koiranammekaan, koiranannekaan, koiraksenikaan, koiraksesikaan, koiraksensakaan, koiraksemmekaan, koiraksennekaan, koirattanikaan, koirattasikaan, koirattansakaan, koirattammekaan, koirattannekaan, koirinenikaan, koirinesikaan, koirinensakaan, koirinemmekaan, koirinennekaan, koirasiko, koiraniko, koiransako, koirammeko, koiranneko, koiraaniko, koiraasiko, koiraansako, koiraammeko, koiraanneko, koirassaniko, koirassasiko, koirassansako, koirassammeko, koirassanneko, koirastaniko, koirastasiko, koirastansako, koirastammeko, koirastanneko, koirallaniko, koirallasiko, koirallansako, koirallammeko, koirallanneko, koirananiko, koiranasiko, koiranansako, koiranammeko, koirananneko, koirakseniko, koiraksesiko, koiraksensako, koiraksemmeko, koiraksenneko, koirattaniko, koirattasiko, koirattansako, koirattammeko, koirattanneko, koirineniko, koirinesiko, koirinensako, koirinemmeko, koirinenneko, koirasikaanko, koiranikaanko, koiransakaanko, koirammekaanko, koirannekaanko, koiraanikaanko, koiraasikaanko, koiraansakaanko, koiraammekaanko, koiraannekaanko, koirassanikaanko, koirassasikaanko, koirassansakaanko, koirassammekaanko, koirassannekaanko, koirastanikaanko, koirastasikaanko, koirastansakaanko, koirastammekaanko, koirastannekaanko, koirallanikaanko, koirallasikaanko, koirallansakaanko, koirallammekaanko, koirallannekaanko, koirananikaanko, koiranasikaanko, koiranansakaanko, koiranammekaanko, koiranannekaanko, koiraksenikaanko, koiraksesikaanko, koiraksensakaanko, koiraksemmekaanko, koiraksennekaanko, koirattanikaanko, koirattasikaanko, koirattansakaanko, koirattammekaanko, koirattannekaanko, koirinenikaanko, koirinesikaanko, koirinensakaanko, koirinemmekaanko, koirinennekaanko, koirasikokaan, koiranikokaan, koiransakokaan, koirammekokaan, koirannekokaan, koiraanikokaan, koiraasikokaan, koiraansakokaan, koiraammekokaan, koiraannekokaan, koirassanikokaan, koirassasikokaan, koirassansakokaan, koirassammekokaan, koirassannekokaan, koirastanikokaan, koirastasikokaan, koirastansakokaan, koirastammekokaan, koirastannekokaan, koirallanikokaan, koirallasikokaan, koirallansakokaan, koirallammekokaan, koirallannekokaan, koirananikokaan, koiranasikokaan, koiranansakokaan, koiranammekokaan, koiranannekokaan, koiraksenikokaan, koiraksesikokaan, koiraksensakokaan, koiraksemmekokaan, koiraksennekokaan, koirattanikokaan, koirattasikokaan, koirattansakokaan, koirattammekokaan, koirattannekokaan, koirinenikokaan, koirinesikokaan, koirinensakokaan, koirinemmekokaan, koirinennekokaan.
Swedish: Breath!!
German: Whattaaa?
English: Okay, now you’re just making things up!
Finnish: And now the plural forms…..”


Filed under Balto-Slavic-Germanic, English language, Finnic, Finnish, Finno-Ugric Languages, German, Germanic, Humor, Indo-European, Indo-Hittite, Language Families, Language Learning, Linguistics, Swedish

Croatia, 1846

There probably wasn’t really any such thing as “Croatia” back then, but anyway, let us discuss what was happening in the territory we currently refer to as the nation of Croatia.

  • What was the official language (Slavic)?
  • What were the two other languages that were widely spoken everyone or nearly everyone along with the official one (both Slavic)?
  • What was the language most commonly spoken by educated people, especially in cities? For instance, if you went into a bookstore in Zagreb, the books would mostly be in this language (non-Slavic)?
  • What was the language of science and the ultra-elites? As an example of how this language was used, what was the official language for the Croatian Parliament? (non-Slavic)?
  • What was the official religion?

Five questions, five whole questions, now hard could it be?

Have fun kids!


Filed under Balto-Slavic, Balto-Slavic-Germanic, Europe, European, German, History, Indo-European, Language Families, Linguistics, Modern, Regional, Religion, Serbo-Croatian, Slavic, Sociolinguistics

Beherrschen Linguisten viele Sprachen?

This is a German translation of the post, Linguists Know Lots of Languages? that appeared first on the old site. I used to have a lot of folks translating articles for me on the old site because I had so much traffic coming in and I wanted to accomodate international readers. I would keep track of how many would come from any country for any post and then tally them up. At some point, I would have enough demand for a transation. The blog was making no money at all, so I was volunteering, so I asked all of my translators to volunteer also.

This post might be interesting to any of you who know German. If you want, I can put the English version of the post in too.

Beherrschen Linguisten viele Sprachen?

Ein weit verbreitetes Mißverständnis ist, dass Linguisten viele Sprachen beherrschen. Eine Abwandlung davon ist, dass wer nicht polyglott ist, auch nicht für einen Linguitik-Studiengang zugelassen wird – und schon gar nicht, wer nur eine Sprache spricht.

Viele ältere Leute denken, das Wort “Linguist” sei ein Synonym für “polyglott”.

Ich habe einen Master in Linguistik und spreche nur eine Sprache gut: Das ist Englisch. Mit Spanisch komme ich einigermaßen zurecht, aber ich beherrsche es nicht fließend und schon gar nicht wie ein Muttersprachler. Ich verstehe ein bisschen Italienisch, Französisch, Portugieseich und Chukchansi Yokuts (eine Sprache kalifornischer Indianer), aber mein Spanisch ist besser, als diese Sprachen.

Als Linguist muss man nicht mehr als eine Sprache beherrschen. Beispielsweise habe ich etwa die Hälfte eines Wörterbuches und Sprachführers in Chukchansi Yokuts fertig gestellt, aber eher würde die Hölle vereisen als dass ich diese Sprache wirklich zu beherrschen lernte. Ich habe nur die Daten gesammelt, organisiert, analysiert und in eine Lexikon und etwas Lehrmaterial umgearbeitet.

Für meinen Linguistik-Studiengang war es nicht einmal Voraussetzung, zweisprachig zu sein, um zugelassen zu werden. Ihn haben Viele studiert, die nur eine Sprache beherrschten. Sicherlich, es gab auch viele ausländische Studenten, die jedoch alle auf einen ESL-Abschluß hinarbeiteten (ESL = English as a second language) und dann wieder im Ausland Englisch als Zweitsprache unterrichten wollten.

Alles was wir machen, ist das Studium von Sprachen. Aber man muss die Sprachen nicht wirklich erlernen um sie studieren zu können. Aus irgend einem Grund verstehen viele Leute das nicht.

Es ist wirklich wahr [in diesem Sinn], dass viele Linguisten mehr als eine Sprache kennen, lesen können und schreiben können.

Ein Linguisten-Witz (Mal seh’n, ob Sie ihn verstehen. Sie müssen viellecht ein bisschen nachdenken.): Man sagt, der berühmte Linguist Roman Jacobsen spräche Russisch in 17 verschiedenen Sprachen.

Leave a comment

Filed under Applied, German, Linguistics, Multilingualism, Translations

Many New Sick and Evil Videos and Photos at the Video Site

One cheer for sick! Two cheers for evil! Three cheers for sick and evil! Hip hip hooray! Hip hip hooray! Hip hip hooray! Sick sick and evil! Sick sick and evil! Sick sick and evil!

Yay! Gather round, boys and girls from 8 to 88, we have have lots of great new twisted videos and pics available on the video site.

Man Swallows His Own Face: The first is a photo of a man in China who appears to be swallowing his own face! This is known as “girning.” I’m not quite sure why some folks can girn and others cannot, but it sure is weird.

Leanne Holland Death Photos: These are photos that were obtained by a website called Viral Death showing the crime scene photos of a young girl named Leanne Holland who was brutally raped and murdered in Australia. A man who was staying with the family was arrested and convicted of the crime, served a number of years in prison, but was then released due to lack of evidence. It appears that he is innocent after all, and the real killer was a known predator who lived a few miles away. The photos don’t show nudity, so there is no child porn. Their release caused quite an outrage in Australia.

Kurdish Teenager Stoned to Death in Iraq: The famous case of the Yezidi girl Doa Khalil Aswad, stoned to death in a crowd of 10,000 men in a village near Mosul. This post has already been published to this site.

Car 1, Motorcycle 0: Shows the aftermath of a car-motorcycle crash somewhere in China, followed by photos of the victim, apparently a prominent politician, meeting with his constituents. By the way, the car won.

Black Man with Split Face: This poor Black guy has had his face nearly split in two but is somehow still alive. Photo. Some say this is fake, but it appears to be genuine.

The Man With No Face: A very strange set of photos of a man who suffered a horrible injury that resulted in the loss of most of his face, including his eyes and nose. As you don’t need these organs, he’s still alive. The surgeon then amazingly constructs an artificial face for the guy with fake eyes and nose and somehow sticks it onto his real face with metal pinnings. Incredible set of heartbreaking photos, but I don’t know the details of the case.

Drunk Girl Knocked Out by Dive: A drunken girl dives off a high cliff into the Colorado River and is knocked unconscious due to the height of the dive. Guys jump in and rescue her.

Dive Gone Wrong in Oregon: Similar video, in this one, another young girl dives off a 90 foot cliff into Lake Oswego in Oregon, which was a totally insane thing to do. She too is knocked out and rescued by guys. It’s amazing she did not die. She was in bad shape for two days but came out of it ok.

Three Men Electrocuted in Iran: This video is amazing, Good Samaritanism gone wrong. An idiot is stuck way up on a high tension pole and is getting shocked. A large crowd has gathered and is screaming. One guy climbs up to rescue him, but he can’t get him down. He starts getting shocked too. Towards the end, a third guy climbs up the pole very fast and grabs the other two guys. Quickly the whole pole blows up in a huge explosion as the guys go flying and the crowd goes nuts. Details not known.

Tire Blows Up under Russian Man: A very strange video. Three young Russian men are drunk in a tire shop. Two guys put an air bag under a tire, cover it with a towel and then encourage a third guy to sit on the tire. The victim apparently knows what is going on and does so willingly. After a short wait, the air bag blows and the guy goes flying in the air. Lots of drunken laughs all around, but this guy could have gotten seriously hurt in this incident.

Chicken with Its Head Cut Off: Laughing rednecks grab a chicken and chop off its head. The headless body runs around a bit, and the other chickens freak out and attack the ghoul chicken. The guys laugh their heads off.

Lunatic Dives into a Burning Car: This is one of the creepier videos I have seen. Guys are filming a burning car by the side of the road somewhere in the US, possibly in the forested northern US. A crowd has gathered to watch the car burn. A fire engine pulls up. Then a guy in his late 20’s suddenly breaks from the crowd towards the car and dives into the burning vehicle. It’s hard to see, but you can see it if you slow the video down and pause it. He has a weird and disturbing expression on his face.

Gang of Teens Attack Elderly Man in Cleveland: A gang of young Black teens is in front of the home of a man in his 70’s. The man and his friend, a man in his 50’s, confront the teens and tell them to get out. Then the two men, both White, drive to a nearby liquor store, apparently to call cops. The gang follows them there and beats both men. The man in his 70’s was beaten badly. The teens run away.

Man Stabbed to Death in New York: It is 3 AM on a street in a bad part of New York City. Video is taken by a security camera. A young woman walks by, and a thug jumps out and tries to rob her. A man runs up to the thug to try to prevent the crime and the thug stabs him with a knife. The guy staggers away and falls down on the sidewalk. For about an hour afterward, people come and go and no one tries to help him. Firemen come after a bit, but he’s already dead.

Crazy Russian Jumps from Building to Building: Amazing video of a young Russian guy engaging in parkour jumping in Russia, where they jump from the roof of one apartment complex to another roof below. Jump may well be 50-75 feet, broken only by a mat below. Very risky sport.

Insane Bungee Jumping off a Waterfall: Young people line up to bungee jump off a waterfall. It’s a long ways down! Footage possibly shot in India. Amazing video.

Idiot Jumps Off Building and Walks Away: A very mysterious video taken off a Danish video site. A young man or teenager is on the roof of a tall building, several stories high. He leaps off the building and lands flat on the ground below, stomach first. There is blood coming from his head. He picks himself up and there is blood dripping off his head. He walks towards the camera and the video ends. I don’t see how this guy survived this fall to walk away, but the production is way too cheap for it to be faked. This video doesn’t really make sense.

Man Commits Suicide at Hoover Dam: Amazing video. Guys are videotaping the face of Hoover Dam and suddenly an idiot jumps off the edge and tumbles down the front, killing himself. The top of his skull appears to come off as he tumbles down the concrete.

Cow Loses Face in Train Crash: There has been a train crash in India, and a cow has been seriously hurt. Most of its face has been torn off, but somehow it is still alive and standing there like nothing is wrong. Scene shifts to the side of the train where people are laying flowers by wreaths.

Girl Throws Live Puppies into River: This is the famous video that was discussed quite a bit on this blog. Video recently went wildly viral. A wild dog in Bosnia gave birth to puppies in the backyard of a home. The dog was very sick. Grandmother found the puppies and feared they were as sick as the mother. She put the puppies in a bucket and gave them to a teenage girl, telling her to throw the puppies in the river and kill them. Video shows girl doing just that. The video resulted in a hunt for the girl. Bosnian authorities tracked her down, but did nothing to her due to her age.

Man Commits Suicide in a Police Station: Famous video of an illegal alien who shot a cop and was then arrested in San Bernardino, California. For some reason, he was never searched. Video shows the man in the interrogation room. Detective leaves to go get some coffee. Criminal reaches into his pocket, pulls out a huge gun and shoots himself in the head, killing himself.

Chinese Fishing Boat Collides with a Japanese Coast Guard Vessel: This was previously published on this site.

Tarzan Swings into a River and Crashes: Two young American guys have fashioned some sort of a rope from a bank out to a river. The distance from launch site to river is about 50 feet or so. One guy goes to the launch site and launches towards the river, then crashes hard onto the bank instead. Looks like he got hurt.

Horrible Injuries and Infections: Series of photos of some very nasty injuries and infections. Everyone survived and is doing ok though.

Brazilian Man with Infected Shoulder from Shooting Steroids: A gruesome operating room video of a Brazilian guy with a massively swollen shoulder from an infection acquired by shooting steroids with a dirty needle. They endless drain the horrible pus from the wound. Really gross.

Farmer Hay Bales Himself: Crazy video. Farmer tears off his clothes and jumps in his hay baler. His friend turns it on. The guy gets packed into a bale of hay and ends up being shot out the chute at the end. He stands up and its a human hay bale with arms and legs sticking out. Really ridiculous.

Bum Eats Rats: Weird video of a bum in Brazil. He reaches down, picks up either rats or mice one by one, puts them in his mouth and munches them down while smiling. He pauses a couple of times to drink some water.

Man with a Shovel Stuck in His Head: Crazy video shot in Brazil. A gang of young men in a favela were roaming around robbing people in their houses. At some point, a posse of angry residents caught up with them and attacked them. One guy was shot several times and someone took a shovel and hit him in the head with it so hard that it stuck in his head. Video shows the crook writhing on the ground, shot by bullets with a shovel stuck in his head. Disturbing.

Guy Has Tongue Splitting Operation: This shows the operation to give you one of those body mod forked tongue things we showed in a gif a while back. File under Don’t try this at home. The incision is made and his mouth is full of blood. Afterward, shows the guy smiling with his new weirdo forked tongue. Really gross video with tons of blood.

Budd Dwyer Suicide Video: The very famous live recorded suicide of Budd Dwyer, former Treasurer of Pennsylvania, recorded before a crowd of reporters taping the whole thing. Detailed here.

Serbian Dogcatcher Chokes a Dog: Previously described on this site.

Foolish Dancer Creamed by Ice Cream Truck: Previously posted on this site.

Weightlifter Tries to Lift Half-ton Barbell, Nearly Dies: Previously posted on this site.

Skateboarder Has Serious Accident and Hurts His Balls: A young boy, maybe around 13 or so, is skateboarding with some other boys. He goes off a staircase with 11 stairs on it and crashes at the bottom. He jumps up and starts screaming that he’s hurt his balls.

Eat My Engine Block!: Previously posted on this site.

Al Jazeera Releases Photos of Sri Lankan Genocide: Previously described on this site.

Bullfighter Gored in Throat by a Bull: Previously posted on this site.

Swedish Man Commits Suicide Live on the Internet: In October 2010, a young Swedish man with Aspergers killed himself live on the Internet while a large group of people watched helpless to do anything about it. Video shows time lapse photography of the man setting up a rope to the ceiling, then fastening his neck to it until he slowly hangs. At the end, the firemen bust down the door and try to rescue him, but he’s gone. Really disturbing.

Black Female Fried Chicken Eating and 40 Oz Drinking Contest: Previously posted on this site.

Ukraine Air Show Disaster 2002: A truly disturbing video of a terrible accident at an air show in the Ukraine in 2002 that killed 92 people when the plane crashed into the crowd. You see the plane doing its maneuvers, then the camera goes nuts as the plane crashes to the ground. The cameraman runs to the scene of the crash, filming all the way. He comes upon injured people, dead people, parts of dead bodies, people missing limbs, people running everywhere to get away or try to help and cops and military personnel everywhere. Curiously, no one tries to stop him filming. Really disturbing stuff.

Pakistani Troops Execute POW’s in Swat Valley: Described on this site earlier today.

Criminals Chase Cops from a Banlieu in France: Recently posted on this site.

Cow Death Row: Previously posted on this site.

Plus lots of new translations into French, Spanish, Portuguese, Italian, Dutch, German, Korean, Norwegian, Tagalog, Polish, Finnish and Swedish.

Have fun sickos!


Filed under Accidents, Animals, Dead Bodies, Diving, Dogs, Domestic, Evil, Gross, Hangings, Humor, Idiots, Medical Procedures, Morbid, Murders, Ridiculousness, Sick, Sick and Evil, Sports, Translations

Mutual Intelligibility of Languages in the Slavic Family

A more updated version of this paper with working hyperlinks can be found on here.

There is much nonsense said about the mutual intelligibility of the various languages in the Slavic family. It’s often said that all Slavic languages are mutually intelligible with each other. This is simply not the case.

Method: It is important to note that the percentages are in general only for oral intelligiblity and only in the case of a situation of a pure inherent intelligiblity test. An inherent pure inherent intelligibility test would involve a a speaker of Slavic lect A listening to a tape or video of a speaker of Slavic Lect A.

Written intelligibility is often very different from written intelligibility in that in a number of cases, it tends to be higher, often much higher, than oral intelligibility. Written intelligibility was only calculated for a number of language pairs. Most pairs have no figure for written intelligibility.

A number of native speakers of various Slavic lects were interviewed about mutual intelligibility, language/dialect confusion, the state of their language, its history and so on. In addition, a Net search was done of forums where speakers of Slavic languages were discussing how much of other Slavic languages they understand. These figures were tallied up for each pair of languages to be tabulated and were then all averaged together. Hence the figures are averages taken from statements by native speakers of the languages in question.

Complaints have been made that many of these percentages were simply wild guesses with no science behind them. This is not the case as all figures were derived from estimates by native speakers themselves, often a number of estimates averaged together.

True science would involve scientific intelligibility testing of Slavic language pairs. There has been no scientific intelligiblity testing of any Slavic language pairs that I am aware of. Obviously, in order to answer these questions in a scientific manner, scientific intelligibility testing needs to be done. The problem is that most linguists are not interested in scientific intelligibility testing of language pairs.


Serbo-Croatian (Shtokavian) has 55% intelligibility of Macedonian (varies from 25-90%), 27% of Slovenian, 25% of Slovak, 20% of Ukrainian, 13% of oral Bulgarian and 25% of written Bulgarian, 10% of oral Russian and 22% of written Russian, 10% of Czech, and 5% of Polish.

Chakavian has 82% intelligibility of Kaikavian.

Kaikavian has 82% intelligibility of Chakavian.

Bulgarian has 80% intelligibility of Macedonian, 41% of Russian (varies from 7-75%) and 5% of Polish and Czech.

Macedonian has 65% oral and written intelligibility of Bulgarian.

Czech has 82% intelligibility of Slovak (varies from 70-95%), 12% of Polish and 5% of Russian and Bulgarian.

Polish has over 90% intelligibility of Sorbian, 22% of Silesian, 12% of Czech, 6% of Russian and 5% of Bulgarian.

Russian has 85% intelligibility of Rusyn, 74% of oral Belorussian and 85% of written Belorussian, 60% of Balachka, 50% of oral Ukrainian and 85% of written Ukrainian, 36% of oral Bulgarian (varies from 7-70%) and 80% of written Bulgarian, 38% of Polish, 30% of Slovak and oral Montenegrin and 50% of written Montenegrin, 12% of oral Serbo-Croatian and 25% of written Serbo-Croatian and 10% of Czech.

Belorussian has 80% intelligibility of Ukrainian and 55% of Polish.
Ukrainian has 82% intelligibility of Belorussian and Rusyn and 55% of Polish.

Slovak has 82% intelligibility of Czech (varies from 70-95%).

Eastern Slovak has 82% intelligibility of Rusyn and 72% of Ukrainian.
Saris Slovak has 85% intelligibility of Polish.


Reactions: So far there have been few reactions to the paper. However, a Croatian linguist has helped me write part of the Croatian section, and he felt that at least that part of the paper was accurate. A Serbian native speaker felt that the percentages for South Slavic seemed to be accurate.

A professor of Slavic Linguistics at a university in Bulgaria reviewed the paper and felt that the percentages were accurate. He was a member of a group of linguists who met periodically to discuss the field. He printed out the paper and showed it to his colleagues at the next meeting, and they spent some time discussing it. No professional linguist has yet to discount the percentages in this paper. The paper seems to have gone over well in the scientific linguistic community.

Now onto the discussion.

There is much nonsense floating around about Serbo-Croatian or Shtokavian. The main Shtokavian dialects of Croatian, Serbian, Montenegrin and Bosnian are mutually intelligible.

However, the Croatian macrolanguage has strange lects that Standard Croatian (Štokavian) cannot understand.

For instance, Čakavian Croatian is not intelligible with Standard Croatian. It consists of at least four major dialects, Ekavian Chakavian, spoken on the Istrian Peninsula, Ikavian Chakavian, spoken in southwestern Istria, the islands of Brač, Hvar, Vis, Korčula, and Šolta, the Pelješac Peninsula, the Dalmatian coast at Zadar and the outskirts of Split, and inland at Gacka, Middle Chakavian, which is Ikavian-Ekavian transitional, and Ijekavian Chakavian, spoken at the far southern end of the Chakavian language area on Lastovo Island, Janjina on the Pelješac Peninsula, and Bigova in the far south near the border with Montenegro.

Ekavian Chakavian has two branches – Buzet and Northern Chakavian. Buzet is actually transitional between Slovenian and Kaikavian. It was formerly thought to be a Slovenian dialect, but some now think it is more properly a Kaikavian dialect. There are some dialects around Buzet that seem to be the remains of old Kaikavian-Chakavian transitional dialects (Jembrigh 2014).

Ikavian Chakavian has two branches – Southwestern Istrian and Southern Chakavian. The latter is heavily mixed with Shtokavian.

Some reports say there is difficult intelligibility between Ekavian Chakavian in the north and Ikavian Chakavian in the far south, but speakers of Labin Ekavian in the far north say they can understand the Southeastern Istrian speech of the southern islands very well (Jembrigh 2014).

Čakavian differs from the other nearby Slavic lects spoken in the country due to the presence of many Italian words.

Chakavian actually has a written heritage, but it was mostly written down long ago. Writing in Chakavian started very early in the Middle Ages and began to slow down in the 1500’s when writing in Kaikavian began to rise. However, Chakavian magazines are published even today (Jembrigh 2014).

Although Chakavian is clearly a separate language from Shtokavian Croatian, in Croatia it is said that there is only one Croatian language, and that is Shtokavian Croatian. The idea is that the Kaikavian and Chakavian languages simply do not exist, though obviously they are both separate languages. Recently a Croatian linguist forwarded a proposal to formally recognize Chakavian as a separate language, but the famous Croatian Slavicist Radoslav Katičić argued with him about this and rejected the proposal on political, not linguistic grounds. This debate occurred only in Croatian linguistic circles and the public knows nothing about it (Jembrigh 2014).

Kaikavian Croatian, spoken in northwest Croatia and similar to Slovenian, is not intelligible with Standard Croatian.

Kaikavian is fairly uniform across its speech area, whereas Chakavian is more diverse (Jembrigh 2014).

In the 1500’s, Kaikavian began to be developed in a standard literary form. From the 1500’s to 1900, a large corpus of Kaikavian literature was written. Kaikavian was removed from public use after 1900, hence writing in the standard Kaikavian literary language was curtailed. Nevertheless, writing continues in various Kaikavian dialects, which still retain some connection to the old literary language, although some lexicon and grammar are going out (Jembrigh 2014).

However, Chakavian and Kaikavian have high, but not full, mutual intelligibility. Intelligibility between the two is estimated at 82%. Most Croatian linguists recognized Kaikavian as a separate language. However, any suggestions that Kaikavian is a separate language are censored on Croatian TV (Jembrigh 2014).

Nevertheless, the ISO has recently accepted a proposal from the Kaikavian Renaissance Association to list the Kaikavian literary language written from the 1500’s-1900 as a recognized language with an ISO code of kjv. The literary language itself is no longer written, but works written in it are still used in public for instance in dramas and church masses (Jembrigh 2014). This is heartening, although honestly, Kaikavian as an existing spoken lect also needs to be recognized as a living language instead of a dialect of “Croatian,” whatever that word means.

Furthermore, there is a dialect continuum between Kaikavian and Chakavian as there is between Kaikavian and Slovenian, and lects with a dialect continuum between them are always separate languages. There is an old Kaikavian-Chakavian dialect continuum of which little remains, although some of the old Kaikavian-Chakavian transitional dialects are still spoken (Jembrigh 2014).
Kaikavian differs from the other Slavic lects spoken in Croatia in that is has many Hungarian and German loans (Jembrigh 2014). Kaikavian is probably closer to Slovenian than it is to Chakavian.

Nevertheless, although intelligibility with Slovenian is high, Kaikavian lacks full intelligibility with Slovenian. Yet there is a dialect continuum between Slovenian and Kaikavian. Kaikavian, especially the Zagorje Kaikavian dialect around Zagreb, is close to the Shtajerska dialect of Slovene. However, leaving aside Kaikavian speakers, Croatians have poor intelligibility of Slovenian.

Molise Croatian is a Croatian language spoken in a few towns in Italy, such as Acquaviva Collecroce and two other towns. A different dialect is spoken in each town. Despite a lot of commonality between the dialects, the differences between them are significant. Intelligibility issues are not known. A koine is currently under development. The Croatians left Croatia and came to Italy from 1400-1500. The base of Molise Croatian was Shtokavian with an Ikavian accent and a heavy Chakavian base similar to what is now spoken as Southern Kaikavian Ikavian on the islands of Croatia. Molise Croatian is not intelligible with Standard Croatian.

Burgenland Croatian, spoken in Austria, is intelligible to Croatian speakers in Austria, Czech Republic, Slovakia and Hungary, but it has poor intelligibility with the Croatian spoken in Croatia.

Therefore, for the moment, there are five separate Croatian languages: Shtokavian Croatian, Kaikavian Croatian, Chakavian Croatian, Molise Croatian and Burgenland Croatian.

Serbian is a macrolanguage made up to two languages: Shtokavian Serbian and Torlak or Gorlak Serbian.

Shtokavian is simply the same Serbo-Croatian language that is also spoken in Croatia, Montenegro and Bosnia. It forms a single tongue and not separate languages as many insist. The claim for separate languages is based more on politics than on linguistic science.

Torlak Serbian is spoken in the south and southwest of Serbia and is transitional to Macedonian. It is not intelligible with Shtokavian, although this is controversial.

Torlakians are often said to speak Bulgarian, but this is not exactly the case. More properly, their speech is best seen as closer to Macedonian than to Bulgarian or Serbo-Croatian. The Serbo-Croatian vocabulary in both Macedonian and Torlakian is very similar, stemming from the political changes of 1912; whereas these words have changed more in Bulgarian.

The Torlakian spoken in the southwest is different. It is not really either Bulgarian or Serbo-Croatian but instead it is best said that they are speaking a mixed Bulgarian-Serbo-Croatian language. In the towns of Pirot and Vranje, it cannot be said that they speak Serbo-Croatian; instead they speak this Bulgarian-Serbo-Croatian mixed speech.

It’s also said that Serbo-Croatian can understand Bulgarian and Macedonian, but this is not true. However, the Torlak Serbians can understand both Macedonian and Bulgarian well, as this is a Serbo-Croatian dialect transitional to both languages. Intelligibility figures for Torlakian and Macedonian/Bulgarian are not known.

Intelligibility in the Slavic languages of the Balkans is much exaggerated.

Slovenian speakers find it hard to understand most of the other Yugoslavian lects except for Kaikavian Croatian. Serbo-Croatian intelligibility of Slovenian is 25-30%.

A lect called Čičarija Slovenian is spoken on the Istrian Peninsula in Slovenia just north of Croatia. This is a Chakavian-Slovenian transitional lect that is hard to categorize, but it is usually considered to be a Slovenian dialect.

Bulgarian and Macedonian can understand each other to a great degree (65-80%), but not completely. However, the Ser-Drama-Lagadin-Nevrokop dialect in northeastern Greece and southern Bulgaria and the Maleševo-Pirin dialect in eastern Macedonia and western Bulgaria are transitional between Bulgarian and Macedonian. The Aegean Macedonian dialects mostly spoken in Greece, such as the Lerinsko-Kostursko and Solunsko-Vodensko dialects, sound more Bulgarian than Macedonian.

Russian has a decent intelligibility with Bulgarian, possibly on the order of 50% (varies from 7-75%) but Bulgarian intelligibility of Russian seems lower. Nevertheless, Bulgarian-Russian intelligibility seems much exaggerated. Some Russians and Bulgarians say they understand almost nothing of the other language. Nevertheless, most Bulgarians over the age of 30-35 understand Russian well since studying Russian was mandatory under Communism.

However, Bulgarian-Russian written intelligibility is much higher. Bulgarian and Russian are close because the Ottoman rulers of Bulgaria would not allow printing in Bulgaria. Hence, many religious books were imported from Russia, and these books influenced Bulgarian. Russian influence only ended in 1878.

Serbo-Croatian and Bulgarian have 10-15% oral intelligibility, however, there are Bulgarian dialects that are transitional with Torlak Serbian. Written intelligibility is higher at 25%. Macedonian and Bulgarian would be much closer together except that in recent years, Macedonian has been heavily influenced by Serbo-Croatian, and Bulgarian has been heavily influenced by Russian.

This difference is because Bulgarian is not spoken the same way it is written like Serbo-Croatian is. However, Bulgarians claim to be able to understand Serbo-Croatian better than the other way around. There is a group of Bulgarians living in Serbia in the areas of Basilograd and Dimotrovograd who speak a Bulgarian-Serbian transitional dialect, and Serbs are able to understand these Bulgarians well.

Serbo-Croatian has variable intelligibility of Macedonian, averaging ~55%, while Nis Serbians have ~90% intelligibility with Macedonian. Part of the problem between Serbo-Croatian and Macedonian is that so many of the basic words – be, do, this, that, where – are different, however, much of the rest of the vocabulary is the same. Serbo-Croatian speakers can often learn to understand Macedonian well after some exposure.

Most Macedonians already are able to speak Serbo-Croatian well. This gives rise to claims of Macedonians being able to understand Serbo-Croatian very well, however, much of this may be due to bilingual learning. In fact, many Macedonians are switching away from the Macedonian language towards Serbo-Croatian.

The Macedonian spoken near the Serbian border is heavily influenced by Serbo-Croatian and is quite a bit different from the Macedonian spoken towards the center of Macedonia. One way to look at Macedonian is that it is a Serbo-Croatian-Bulgarian transitional lect. The intelligibility of Serbo-Croatian and Macedonian is highly controversial, and intelligibility studies are in order.
Croats say Macedonian is a complete mystery to them.

Czech and Polish are incomprehensible to Serbo-Croatian (Czech 10%, Polish 5%), but Serbo-Croatian has some limited comprehension of Slovak, on the order of 25%.

Serbo-Croatian and Russian have 10-15% intelligibility, if that, yet written intelligibility is higher at 25%.

Serbo-Croatian has only 20% intelligibility of Ukrainian.

Slovenians have a very hard time understanding Poles and Czechs and vice versa.

It’s often said that Czechs and Poles can understand each other, but this is not so. Much of the claimed intelligibility is simply bilingual learning. Czechs claim only 10-15% intelligibility of Polish.

The intelligibility of Polish and Russian is very low, on the order of 5-10%.
Polish is not intelligible with Kashubian, a language related to Polish spoken in the north of Poland, but figures are not known. Kashubian itself is a macrolanguage made up of two different languages, South Kashubian and North Kashubian, as the two have difficult intelligibility.

Silesian or Upper Silesian is also a separate language spoken in Poland, often thought to be halfway between Polish and Czech. It may have been split from Polish for up to 800 years, where it underwent heavy German influence. Polish lacks full intelligibility of Silesian, although this is controversial (see below). Some Poles say they find Silesian harder to understand than Belorussian or Slovak, which implies intelligibility of 20-25%.

The more German the Silesian dialect is, the harder it is for Poles to understand. In recent years, many of the German words are falling out of use and being replaced by Polish words, especially by young people. Poles who know German and Old Polish can understand Silesian quite well due to the Germanisms and the presence of many older Polish words, but Poles who speak only Polish have a hard time with Silesian.

Many Poles insist that Silesian is a Polish dialect, but this is based more on politics than reality. In fact, people in the north of Poland regard Silesian as incomprehensible. 40% of Silesian vocabulary is different from Polish, mostly Germanisms. The German influence is more prominent in the west; Polish influence is greater in the east. Many Silesian speakers now speak a watered down version of Silesian which is more properly seen as a Polish dialect with some Silesian words. Pure Silesian appears to be a dying language.

Silesian itself appears to be a macrolanguage as it is more than one language since as Opole Silesian speakers cannot understand Katowice Silesian, so Opole Silesian and Katowice Silesian are two different languages.

Cieszyn Silesian or Ponaszymu is a language closely related to Silesian spoken in Czechoslovakia in the far northeast of the country near the Polish and Slovak borders. It differs from the rest of Silesian in that it has undergone heavy Czech influence. Some say it is a part of Czech, but more likely it is a part of Polish like Silesian.

People observing conversation between Cieszyn Silesian and Upper Silesian report that they have a hard time understanding each other. Cieszyn Silesian speakers strongly reject the notion that they speak the same language as Upper Silesians. Ponaszymu also has many Germanisms which have been falling out of use lately, replaced by their Czech equivalents. Ponaszymu appears to lack full intelligibility with Czech. In fact, some say the intelligibility between the two is near zero.

Lach is a Czech-Polish transitional lect with a close relationship with Cieszyn Silesian. However, it appears to be a separate language, as Lach is not even intelligible within itself. Instead Eastern Lach and Western Lach have difficult intelligiblity and are separate languages so Lach itself is a macrolanguage. Lach is not fully intelligible with Czech; indeed, the differences between Lach and Czech appear to be greater than the differences between Silesian and Polish, despite the fact that Lach has been heavily leveling into Moravian Czech for the last 100 years.

Czechs say Lach is a part of Czech, and Poles say Lach is a part of Polish. The standard view among linguists seems to be that Lach is a part of Czech. However, another view is that Lach is indeed Lechitic, albeit with strong Czech influence.

Polish has excellent intelligibility of Upper Sorbian and Lower Sorbian, possibly over 90%. Furthermore, Upper and Lower Sorbian have over 90% intelligibility of each other, so instead of being two different languages, they are dialects of a single tongue, Sorbian.

It is often said that Ukrainian and Russian are intelligible with each other or even that they are the same language (a view perpetuated by Russian nationalists). It is not true at all that Ukrainian and Russian are mutually intelligible, as Russian only has 50% intelligibility of Ukrainian. For example, all Russian shows get subtitles on Ukrainian TV. Yet some say that the subtitles are simply put on as a political move due to Ukraine’s puristic language policy. Ukrainian and Russian only have 60% lexical similarity. Polish and Ukrainian have higher lexical similarity at 72%, and Ukrainian intelligibility of Polish is ~50%+.

However, there are dialects in between Ukrainian and Russian such as the Eastern Polissian and Slobozhan dialects of Ukrainian that are intelligible with both languages. Complicating the picture is the fact that many Ukrainians are bilingual and speak Russian also. Ukrainians can understand Russian much better than the other way around. Nevertheless Ukrainian intelligibility of Russian is hard to calculate because presently there are few Ukrainians in Ukraine who do not speak Russian. Most of the Ukrainian speakers who do not speak Russian are in Canada at the moment.

In addition, the Slobozhan dialects of Ukrainian and Russian such as (Slobozhan Ukrainian and Slobozhan Russian) spoken in the Kantemirovka (Voronezhskaya Oblast, Russia), and Kuban Russian or Balachka spoken in the Kuban area right over the eastern border of Ukraine are very close to each other. Slobozhan Russian can also be called Kuban Russian or Balachka.

It is best seen as a Ukrainian dialect spoken in Russia – specifically, it is markedly similar to the Poltavian dialect of Ukrainian spoken in Poltava in Central Ukraine. Although the standard view is that Balachka is a Ukrainian dialect, some linguists say that it is actually a separate language closely related to Ukrainian. An academic paper has been published making the case for a separate Balachka language. In addition, Balachka language associations believe it is a separate language. Intelligibility between Balachka and Ukrainian is not known. Russian only has 60% intelligibility of Balachka.

However, Balachka is dying out and is now spoken only by a few old people. Most people in the region speak Russian with a few Ukrainian words.

Slobozhan Russian is very close to Ukrainian, closer to Ukrainian than it is to Russian, and Slobozhan Ukrainian is very close to Russian, closer to Russian than to Ukrainian. Slobozhan Ukrainian speakers in this region find it easier to understand their Russian neighbors than the Upper Dniestrian Ukrainian spoken in the far west in the countryside around Lvov. Upper Dniestrian is influenced by German and Polish.

The Russian language in the Ukraine has been declining recently mostly because since independence, the authorities have striven to make the new Ukrainian as far away from Russian as possible by adopting  the Kharkiv Standard adopted in 1927 and jettisoning the 1932 Standard which brought Ukrainian more in line with Russian. For instance, in 1932, Ukrainian g was eliminated from the alphabet in order to make Ukrainian h correspond perfectly with Russian g. After 1991, the g returned to Ukrainian. Hence, Russians understand the colloquial Ukrainian spoken in the countryside pretty well, but they understand the modern standard heard on TV much less. This is because colloquial Ukrainian is closer to the Ukrainian spoken in the Soviet era, which had huge Russian influence.

The intelligibility of Belorussian with both Ukrainian and Russian is a source of controversy. On the one hand, Belorussian has some dialects that are intelligible with some dialects of both Russian and Ukrainian. For instance, West Palesian is a transitional Belarussian dialect to Ukrainian. Some say that West Palesian is actually a separate language, but the majority of Belarussian linguists say it is a dialect of Belarussian (Mezentseva 2014). Belarussian and Ukrainian have 85% similar vocabulary.

Nevertheless, Russian has high intelligibility of Belarussian, on the order of 75%. Belarussian is nonetheless a separate language from both Ukrainian and Russian.

From some reason, the Hutsul, Lemko, Boiko dialects of the Rusyn language are much more comprehensible to Russians than Standard Ukrainian is. Intelligibility may be 85%. Rusyn-Ukrainian intelligibility is described as similar to Czech-Slovak intelligibility – therefore, the intelligibility between Rusyn and Ukrainian is around 82%.

Rusyn-Ukrainian intelligibility is said to be the same as Ukrainian-Belorussian intelligibility, so Ukrainian and Belarussian also have ~82% intelligibility. At least the Lemko dialect of Rusyn has only marginal intelligibility with Ukrainian. Lemko is spoken heavily in Poland, and it differs from Standard Rusyn in that it has a lot of Polish vocabulary, whereas Standard Rusyn has more influences from Hungarian and Romanian.

The Rusyn language is composed of 50% Slovak roots and 50% Ukrainian roots, so some difficult intelligibility with Ukrainian might be expected. It has also been described as a transitional dialect between Polish and Slovak. Eastern Slovak has ~80% intelligibility of Rusyn.

Pannonian Rusyn is spoken by a group of Rusyns who migrated to northwestern Serbia (the Bachka region in Vojvodina province) and Eastern Croatia from eastern Slovakia and western Ukraine 250 years ago. Pannonian Rusyn is actually a part of Slovak, and Rusyn proper is really a part of Ukrainian. Pannonian Rusyn lacks full intelligibility of Rusyn proper. Not only that, but it is not even fully intelligible with the Eastern Slovak that it resembles most.

The intelligibility of Czech and Slovak is much exaggerated. It is true that Western Slovak dialects can understand Czech well, but Central Slovak, Eastern Slovak and Extraslovakian Slovak dialects cannot.

It is also said that West Slovak (Bratislava) cannot understand East Slovak, so Slovak may actually two different languages, but this is controversial. Western Slovak speakers say Eastern Slovak sounds idiotic and ridiculous, and some words are different, but other than that, they can basically understand it. Other Western Slovak speakers (Bratislava) say that Eastern Slovak (Kosice) is hard to understand. Bratislava speakers say that Kosice speech sounds 1/2 Slovak and 1/2 Ukrainian and uses many odd and unfamiliar words. Intelligibility testing between East and West Slovak would seem to be in order.

Much of the claimed intelligibility between Czech and Slovak was simply bilingual learning. Since the breakup, young Czechs and Slovaks understand each other worse since they have less contact with each other. In the former Czechoslovakia, everything was 50-50 bilingual – media, literature, etc. Since then, Slovak has been disappearing from the Czech Republic, so the younger people don’t understand Slovak so well.

Intelligibility of Czech and Slovak is 82% and varies from 70-95% depending on the dialect. Intelligibility problems are mostly on the Czech end because they don’t bother to learn Slovak while many Slovaks learn Czech. There is as much Czech literature and media as Slovak literature and media in Slovakia, and many Slovaks study at Czech universities. When there, they have to pass a language test. Czechs hardly ever study at Slovak universities.

Czechs see Slovaks as country bumpkins – backwards and folksy but optimistic, outgoing and friendly. Czechs are more urbane. The written languages differ much more than the spoken ones.

The languages really split about 1,000 years ago, but written Slovak was based on written Czech, and there was a lot of interlingual communication. A Moravian Czech speaker (Eastern Czech) and a Bratislavan Slovak (Western Slovak) speaker understand each other very well. They are essentially speaking the same language.

However, in recent years, there has also been quite a bit of bilingual learning. Young Czechs and Slovaks talk to each other a lot via the Internet. There are also some TV shows that show Czech and Slovak contestants untranslated (like in Sweden where Norwegian comics perform untranslated), and most people seem to understand these shows.

All foreign movies in both the Czech Republic and Slovakia are translated into Czech, not Slovak. Far Northeastern Slovak (Saris Slovak) near the Polish border is close to Polish and Ukrainian. Intelligibility data for Saris Slovak and Ukrainian is not known. Saris Slovak has high but not complete intelligibility of Polish, possibly 85%. Eastern Slovak may have 72% intelligibility of Ukrainian.

Southern Slovak on the Hungarian border has a harder time understanding Polish because they do not hear it much. This implies that some of the high intelligibility between Slovak and Polish may be due to bilingual learning on the part of Slovaks.

Russian has low intelligibility with Czech and Slovak, maybe 30%.


Jembrigh, Mario. Croatian linguist. December 2014. Personal communication.

Mezentseva, Inna. English professor. Vitebsk State University. Vitebsk, Belarus. December 2014. Personal communication.

If you think this website is valuable to you, please consider a contribution to support the continuation of the site. Donations are the only thing that keep the site operating.


Filed under Applied, Balto-Slavic, Balto-Slavic-Germanic, Bulgarian language, Comparitive, Czech, Dialectology, Indo-European, Indo-Hittite, Language Classification, Language Families, Language Learning, Linguistics, Multilingualism, Polish, Russian, Serbo-Croatian, Slavic, Slovak, Sociolinguistics

More On The Hardest Languages To Learn – Non-Indo-European Languages

Caution: This post is very long. It runs to 200 pages on the Net. Updated January 17, 2016.

This is a continuation of the earlier post. I split it up into two parts because it had gotten too long.

The post refers to which languages are the hardest for English speakers to learn, though to some extent, the ratings are applicable across languages. Most Chinese speakers would recognize Spanish as being an easy language, despite its alien nature. And even most Chinese, Navajo, Poles or Czechs acknowledge that their languages are hard to learn. To a certain extent, difficulty is independent of linguistic starting point. Some languages are just harder than others, and that’s all there is to it.

Method, Results and Conclusion. See here.

In this case, 73 non-IE languages were examined.

Ratings: Languages are rated 1-6, easiest to hardest. 1 = easiest, 2 = moderately easy to average, 3 = average to moderately difficult, 4 = very  difficult, 5 = extremely difficult, 6 = most difficult of all.

Time needed: Time needed to learn the language “reasonably well”: Level 1 languages = 3 months-1 year. Level 2 languages = 6 months-1 year. Level 3 languages = 1-2 years. Level 4 languages = 2 years. Level 5 languages = 3-4 years, but some may take longer.

Northeast Caucasian, Northwest Caucasian and Kartvelian

Of course the Caucasian languages like Tsez, Tabasaran, Georgian, Chechen, Ingush, Abkhaz and Circassian are some of the hardest languages on Earth to learn.

Chechen and Circassian are rated 6, hardest of all.

Northeast Caucasian

NE Caucasian languages have the uvulars and ejectives of Georgian in addition to pharyngeals, lateral fricatives, and other strangeness. They have noun classes like the Bantu languages (but usually fewer). Nevertheless, they have noun class agreement markers on verbs on adjectives. One thing NE Caucasian has is lots of case. Some languages have 40+ cases. They are built from the ground up via two forms – one a spatial form such as in, on or around and the other a directional motion form such as to, from, through or at.


Tsez has 64-126 different cases, making it by far the most complex case system on Earth! It is one of the few languages on Earth that has two genitive cases – Genitive 1 (-s) and Genitive 2 (-z). Genitive 1 is used when the genitive’s head noun is in absolutive case and Genitive 2 is used when the genitive’s head noun is in any other case. It also has four noun classes. It is said that even native speakers have a hard time picking up the correct inflection to use sometimes.

In Tsez, you need to know a lot Tsez grammar to communicate at a basic level. The sentence:

English: I like your mother.

Tsez: Дāьр деби энийу йетих. (Dǟr debi eniyu yetix.)

In order to speak that sentence in Tsez, you need to know:

• the words themselves (word order is not as important)
• that the verb -eti- requires the subject to be in the dative/lative case and the object to be in the absolutive
• the noun class for eniyu (class II)
• the dative/lative form of di (I), which is dǟr
• the genitive 1 form of mi (you), which is debi
• the congruence prefix y- that corresponds to the noun class of the absolutive argument of the phrase, in this case mother
• the present tense ending for vowel-final verbs -x

Tsez is rated 6, hardest of all.


Archi has an extremely complex phonology and one of the most complicated grammars on Earth. The extreme fusional aspects and the verbal morphology are what make the grammar so difficult. Every verb root has 1,502,839 possible forms! It is also an ergative language, but there is irregularity in its ergative system.

Some verbs take the typical ergative/absolutive case (absolutive for the subject of an intransitive very and ergative for the subject of a transitive verb – where the direct object would be in absolutive). In others the subject is in dative rather than the expected ergative/absolutive case. These are usually verbs of perception like love/want, hear, see, feel, and be bored. For instance, the verb:

-эти- = to love/want must have its subject in dative case instead of the expected absolutive or ergative case.

Among non-click languages, Archi has one of the largest consonant inventories, with only the extinct Ubykh having more. There are 26 vowels and between 76 and 82 consonants, depending on the analysis. Five of the six vowels can occur in five varieties: short, pharyngealized, high tone, long (with high tone), and pharyngealized with high tone.

It has many unusual phonemes, including contrasts between several voiceless velar lateral fricatives, voiceless and ejective velar lateral affricates and a voiced velar lateral fricative. The voiceless velar lateral fricative ʟ̝̊, the voiced velar lateral fricative ʟ̝, and the corresponding voiceless and ejective affricates k͡ʟ̝̊ and k͡ʟ̝̊ʼ are extremely unusual sounds, as velar fricatives are not typically laterals.

There are 15 cases, 10 regular cases, five spatial cases and five directional cases. The Spatial cases are Inessive (in), Intrative (between), superessive (above), Subessive (below) and Pertingent (against). The directional cases are Essive (as), Elative (out of), Lative (to/into), Allative (onto), Terminative (specifies a limit) and Translative (indicates change).

There are four noun classes:

I Male human
II Female human
III All insects, some animates, and some inanimates
IV Abstracts, some animates, and some inanimates that can only be seen via verbal agreement

Archi is rated 6, hardest of all.

Eastern Samur

Tabasaran is rated the 3rd most complex grammar in the world, with 48 different noun cases.

Tabasaran is rated 6, hardest of all.


Ingush has a very difficult phonology, an extremely complex grammar, and furthermore, is extremely irregular. Ingush also has a proximate/obviate distinction and is the only language in the region that has this feature. Ingush along with Chechen both have a closed class of verbs, an unusual feature in the world’s languages. New verbs are formed by adding a noun to the verb do:

shootdo gun

Ingush is rated 6, hardest of all.


One problem with Georgian is the strange alphabet: ქართულია ერთ ერთი რთული ენა. It also has lots of glottal stops that are hard for many foreigners to speak; consonant clusters can be huge – up to eight consonants stuck together (CCCCCCCCVC)- and many consonant sounds are strange. In addition, there are uvulars and ejectives. Georgian is one of the hardest languages on Earth to pronounce. It regularly makes it onto craziest phonologies lists.

Its grammar is exceedingly complex. Georgian is both highly agglutinative and highly irregular, which is the worst of two worlds. Other agglutinative languages such as Turkish and Finnish at least have the benefit of being highly regular. The verbs in particular seem nearly random with no pattern to them at all. The system of argument and tense marking on the verb is exceedingly complex, with tense, aspect, mood on the verb, person and number marking for the subject, and direct and indirect objects.

Although it is an ergative language, the ergative (or active-stative case marking as it is called) oddly enough is only used in the aorist and perfect tenses where the agent in the sentence receives a different case, while the aorist also masquerades as imperative. In the present, there is standard nominative-accusative marking. A single verb can have up to 12 different parts, similar to Polish, and there are six cases and six tenses.

Georgian also features something called polypersonal agreement, a highly complex type of morphological feature that is often associated with polysynthetic languages and to a lesser extent with ergativity.

In a polypersonal language, the verb has agreement morphemes attached to it dealing with one or more of the verbs arguments (usually up to four arguments). In a non polypersonal language like English, the verb either shows no agreement or agrees with only one of its arguments, usually the subject. Whereas in a polypersonal language, the verb agrees with one or more of the subject, the direct object, the indirect object, the beneficiary of the verb, etc. The polypersonal marking may be obligatory or optional.

In Georgian, the polypersonal morphemes appear as either suffixes or prefixes, depending on the verb class and the person, number, aspect and tense of the verb. The affixes also modify each other phonologically when they are next to each other. In the Georgian system, the polypersonal affixes convey subject, direct object, indirect object, genitive, locative and causative meanings.

g-mal-av-en = they hide you
= they hide it from you

mal (to hide) is the verb, and the other four forms are polypersonal affixes.

In the case below,

xelebi ga-m-i-tsiv-d-a = My hands got cold.

xelebi means hands. The m marker indicates genitive or my. With intransitive verbs, Georgian often omits my before the subject and instead puts the genitive onto the verb to indicate possession.

Georgian verbs of motion focus on deixis, whether the goal of the motion is towards the speaker or the hearer. You use a particle to signify who the motion is heading towards. If it heading towards neither of you, you use no deixis marker. You specify the path taken to reach the goal through the use or prefixes called preverbs, similar to “verbal case.” These come after the deixis marker:

up             a-
out            ga-
in             sha-
down into      cha-
across/through garda-
thither        mi-
away           c’a-
or down        da-


up towards me = amo-. The deixis marker is mo- and up is a-

On the plus side, Georgian has borrowed a great deal of Latinate foreign vocabulary, so that will help anyone coming from a Latinate or Latinate-heavy language background.

Georgian is rated 5, extremely difficult.

Northwest Caucasian

All NW Caucasian languages are characterized by a very small number of vowels (usually only two or three) combined with a vast consonant inventory, the largest consonant inventories on Earth. Almost any consonant can be plain, labialized or palatalized. This is apparently the result of an historical process whereby many vowels were lost and their various features became assigned to consonants. For instance, palatalized consonants may have come from Ci sequences and labialized consonants may have come from Cu sequences.

The grammars of these languages are complex. Unlike the NE Caucasian languages, they have simple noun systems, usually with only a handful of cases.

However, they have some of the complex verbal systems on Earth. These are some of the most synthetic languages in the Old World. Often the entire syntax of the sentence is contained within the verb. All verbs are marked with ergative, absolutive and direct object morphemes in addition to various applicative affixes.

These are akin to what some might call “verbal case.” For instance, in applicative voice systems, applicatives may take forms such as comitative, locative, instrumental, benefactive and malefactive. These roles are similar to the case system in nouns – even the names are the same. So you can see why some call this “verbal case.”

NW Caucasian verbs can be marked for aspect (whether something is momentous, continuous or habitual), mood (if something is certain, likely, desired, potential, or unreal). Other affixes can shape the verb in an adverbial sense, to express pity, excess or emphasis.

Like NE Caucasian, they are also ergative.

NW Caucasian makes it onto a lot of craziest language lists.

These are some of the strangest sounding languages on Earth. Of all of these languages, Abaza has the most consonants. Here is a video in the Abaza language.


Ubykh, a Caucasian language of Turkey, is now extinct, but there is one second language speaker, a linguist who is said to have taught himself the language. It has more consonants than any non-click language on Earth – 84 consonant sounds in all. Furthermore, the phonemic inventory allows some very strange consonant clusters.

Ubykh has many rare consonant sounds. is only also found in two of Ubykh’s relatives, Abkhaz and Abaza and in two other languages, both in the Brazilian Amazon. The pharyngealized labiodental voiced fricative  does not exist in any other language. It often makes it onto weirdest phonologies lists. Ubykh also got a very high score on a study of the weirdest languages on Earth.

Combine that with only two vowel sounds and a highly complex grammar, and you have one tough language.

In addition, Ubykh is both agglutinative and polysynthetic, ergative and has polypersonal agreement:

If only you had not been able to make him take it all out from under me again for them…

There are an incredible 16 morphemes in that nine syllable word.

Ubykh has only four case systems on its nouns, but much case function has shifted over to the verb via preverbs and determinants. It is these preverbs and determinants that make Ubykh monstrously complex. The following are some of the directional preverbs:

  • above and touching
  • above and not touching
  • below and touching
  • below and not touching
  • at the side of
  • through a space
  • through solid matter
  • on a flat horizontal surface
  • on a non-horizontal or vertical surface
  • in a homogeneous mass
  • towards
  • in an upward direction
  • in a downward direction
  • into a tubular space
  • into an enclosed space

There are also some preverbal forms that indicate deixis:

j-  = towards the speaker

Others can indicate ideas that would take up whole phrases in English:

jtɕʷʼaa- = on the Earth, in the Earth

ʁadja ajtɕʷʼaanaaɬqʼa
They buried his body.
(Lit. They put his body in the earth.)

faa– = out of, into or with regard to a fire.

Amdʒan zatʃətʃaqʲa faastχʷən.
I take a brand out of the fire.

Morphemes may be as small as a single phoneme:

They give you to him.

w – 2nd singular absolutive
a – 3rd singular dative
n – 3rd ergative
– to give
aa – ergative plural
n – present tense

Adverbial suffixes are attached to words to form meanings that are often formed by aspects or tenses in other languages:

asfəpχaI need to drink it.
I can drink it.
I drink it all the time.
I am drinking it all up.
I drink it too much.
I drink it again.

Nouns and verbs can transform into each other. Any noun can turn into a stative verb:


I was a child.
(Lit. I child-waschild-was is a verb – to be a child.)

By the same token, many verbs can become nouns via the use of a nominal affix:

qʼato say

what I say
– (Lit. That which I saymy speech, my words, my language, my orders, etc.

Number is marked on the verb via a verbal suffix and is only marked on the noun in the ergative case.

However, it does lack the convoluted case systems of the Caucasian languages next door and there is no grammatical gender.

Ubykh is rated 6, hardest of all.


Abkhaz is an extremely difficult language to learn. Each basic consonant has eight different positions of articulation in the mouth. Imagine how difficult that would be for an Abkhaz child with a speech impediment. Abkhaz seems to put agreement markers on just about everything in the language. Abkhaz makes it onto many craziest language lists, and it recently got a very high score on a weirdest language study.

Abkhaz is rated 6, hardest of all.


Burushaski is often thought to be a language isolate, related to no other languages, however, I think it is Dene-Caucasian. It is spoken in the Himalaya Mountains of far northern Pakistan in an area called the Hunza. It’s verb conjugation is complex, it has a lot of inflections, there are complicated ways of making sentences depending on many factors, and it is an ergative language, which is hard to learn for speakers of non-ergative languages. In addition, there are very few to no cognates for the vocabulary.

Burushaski is rated 6, hardest of all.

American Indian Languages

American Indian languages are also notoriously difficult, though few try to learn them in the US anyway. In the rest of the continent, they are still learned by millions in many different nations. You almost really need to learn these as a kid. It’s going to be quite hard for an adult to get full competence in them.

One problem with these languages is the multiplicity of verb forms. For instance, the standard paradigm for the overwhelming number of regular English verbs is a maximum of five forms:


Many Amerindian languages have over 1,000 forms of each verb in the language.


Yet the Salishans (see below) always considered the neighboring language Kootenai to be too hard to learn. Kootenai also has a distinction between proximate/obviate along with direct/inverse alignment, probably from contact with Algonquian.

However, the Kootenai direct/inverse system is less complex than Algonquian’s, as it is present only in the 3rd person. Kootenai also has a very strange feature in that they have particles that look like subject pronouns, but these go outside of the full noun phrase. This is a very rare feature in the world’s languages. Kootenai scored very high on a weirdest language survey.

Kootenai is an isolate spoken in Idaho by 100 people.

Kootenai is rated 6, hardest of all.


Yuchi is a language isolate spoken in the Southern US. They were originally located in Eastern Tennessee and were part of the Creek Confederacy at one time. Yuchi is nearly extinct, with only five remaining speakers.

Yuchi has noun genders or classes based on three distinctions of position: standing, sitting or lying. All nouns are either standing, sitting or lying. Trees are standing, and rivers are lying, for instance. It it is taller than it is wide, it is standing. It if is  wider than it is tall, it is lying.

If it is about as about as wide as it is tall, it is sitting. All nouns are one of these three genders, but you can change the gender for humorous or poetic effect. A linguist once asked a group of female speakers whether a penis was standing, sitting or lying. After lots of giggles, they said the default was sitting, but you could say it was standing or lying for poetic effect.

Also all Yuchi pronouns must make a distinction between age (older or younger than the speaker) and ethnicity (Yuchi or non-Yuchi).

Yuchi gets a 6 rating, hardest of all.


Tlingit is probably one of the hardest, if not the hardest, language in the world. Tlingit is analyzed as partly synthetic, partly agglutinative, and sometimes polysynthetic. It has not only suffixes and prefixes, but it also has infixes or affixes in the middle of words.

‘eechto pick

All prefixes must be in proper order for the word to work.

I am usually picking, on purpose, a long object through the hole while standing on a table.

I am usually being forced to pick a long object through the hole while standing on a table.

I am usually being picking the edible long object through the hole while standing on a table.

Tlingit has a pretty unusual phonology. For one thing, it is the only language on Earth with no l. This despite the fact that it has five other laterals: dl (), tl (tɬʰ), tl’ (tɬʼ), l (ɬ) and l’ (ɬʼ). The tɬʼ and ɬʼ sounds are rare in the world’s languages. ɬʼ  is only found in the wild NW Caucasian languages. It also has two labialized glottal consonants, ʔʷ and hw ().

Tlingit gets a 6 rating, hardest of all.


Navajo has long, short and nasal vowels, a tone system and a grammar totally unlike anything in Indo-European. A stem of only four letters or so can take enough affixes to fill a whole line of text.

Navajo is a polysynthetic language. In polysynthetic languages, very long words can denote an entire sentence, and it’s quite hard to take the word apart into its parts and figure out exactly what they mean and how they go together. The long words are created because polysynthetic languages have an amazing amount of morphological richness. They put many morpheme together to create a word out of what might be a sentence in a non-polysynthetic language.

Some Navajo dictionaries have thousands of entries of verbs only, with no nouns. Many adjectives have no direct translation into Navajo. Instead, verbs are used as adjectives. A verb has no particular form like in English – to walk. Instead, it assumes various forms depending on whether or not the action is completed, incomplete, in progress, repeated, habitual, one time only, instantaneous, or simply desired. These are called aspects. Navajo must have one of the most complex aspect systems of any language:

The Primary aspects:

Momentaneous – punctually (takes place at one point in time)
Continuative – an indefinite span of time & movement with a specified direction
Durative – over an indefinite span of time, non-locomotive uninterrupted continuum
Repetitive – a continuum of repeated acts or connected series of acts
Conclusive – like durative but in perfective terminates with static sequel
Semelfactive – a single act in a repetitive series of acts
Distributive – a distributive manipulation of objects or performance of actions
Diversative – a movement distributed among things (similar to distributive)
Reversative – results in directional change
Conative – an attempted action
Transitional – a shift from one state to another
Cursive – progression in a line through time/space (only progressive mode)

The subaspects:

Completive – an event/action simply takes place (similar to the aorist tense)
Terminative – a stopping of an action
Stative – sequentially durative and static
Inceptive – beginning of an action
Terminal – an inherently terminal action
Prolongative – an arrested beginning or ending of an action
Seriative – an interconnected series of successive separate & distinct acts
Inchoative – a focus on the beginning of a non-locomotion action
Reversionary – a return to a previous state/location
Semeliterative – a single repetition of an event/action

The tense system is almost as wild as the aspectual system.

For instance, the verb ndideesh means to pick up or to lift up. But it varies depending on what you are picking up:

ndideeshtiilto pick up a slender stiff object (key, pole)
to pick up a slender flexible object (branch, rope)
to pick up a roundish or bulky object (bottle, rock)
to pick up a compact and heavy object (bundle, pack)
to pick up a non-compact or diffuse object (wool, hay)
to pick up something animate (child, dog)
to pick up a few small objects (a couple of berries, nuts)
to pick up a large number of small objects (a pile of berries, nuts)
to pick up something flexible and flat (blanket, piece of paper)
to pick up something I carry on my back
to pick up anything in a vessel
to pick up mushy matter (mud).

But picking up is only one way of handling the 12 different consistencies. One can also bring, take, hang up, keep, carry around, turn over, etc. objects. There are about 28 different verbs one can use for handling objects. If we multiply these verbs by the consistencies, there are over 300 different verbs used just for handling objects.

In Navajo textbooks, there are conjugation tables for inflecting words, but it’s pretty hard to find a pattern there. One of the most frustrating things about Navajo is that every little morpheme you add to a word seems to change everything else around it, even in both directions.

Navajo is said to have a very difficult system for counting numerals.

There is also a noun classifier system with more than a dozen classifiers that affect inflection. This is quite a few classifiers even for a noun classifier language and is similar to African languages like Zulu. In addition, it has the strange direct/inverse system.

To add insult to injury, Navajo is an ergative language.

Navajo also has an honorifics or politeness system similar to Japanese or Korean.

Navajo also has the odd feature where the word niinaabecause can be analyzed as a verb.

X áhóót’įįd biniinaa…
Because X happened…

Shiniinaa sits’il.
It broke into pieces because of me.

In the latter sentence, the only way we know that 1st singular was involved in because of the person marking on niinaa.

There are 25 different kinds of pronominal prefixes that can be piled onto one another before a verb base.

Navajo has a very strange feature called animacy, where nouns take certain verbs according to their rank in the hierarchy of animation which is a sort of a ranking based on how alive something is. Humans and lightning are at the top, children and large animals are next and abstractions are at the bottom.

All in all, Navajo, even compared to other polysynthetic languages, has some of the most incredibly complicated polysynthetic morphology of any language. On craziest grammar and craziest language lists, Navajo is typically listed.

It is even said that Navajo children have a hard time learning Navajo as compared to children learning other languages, but Navajo kids definitely learn the language. Similarly with Hopi below, even linguists find even the best Navajo grammars difficult or even impossible to understand.

However, Navajo is quite regular, a common feature in Amerindian languages.

Navajo is rated 6, hardest of all.


Slavey, a Na-Dene language of Canada, is hard to learn. It is similar to Navajo and Apache. Verbs take up to 15 different prefixes. All Athabascan languages have wild verbal systems. It also uses a completely different alphabet, a syllabic one designed for Canadian Indians.

Slavey is rated 6, hardest of all.


Haida is often thought to be a Na-Dene language, but proof of its status is lacking. If it is Na-Dene, it is the most distant member of the family. Haida is in the competition for the most complicated language on Earth, with 70 different suffixes.

Haida is rated 6, hardest of all.


The Salishan languages spoken in the Northwest have a long reputation for being hard to learn, in part because of long strings of consonants, in one case 11 consonants long. Salish languages are the only languages on Earth that allow words without sonorants.

Many of the vowels and consonants are not present in most of the world’s widely spoken languages. The Salish languages are, like Chukchi, polysynthetic. Some translations treat all Salish words are either verbs or phrases. Some say that Salish languages do not contain nouns, though this is controversial. The verbal system of Salish languages is absurdly complex.

All Salishan languages are rated rated 6, hardest of all.

Nuxálk (Bella Coola)

Nuxálk is a notoriously difficult Salishan Amerindian language spoken in British Colombia. It is famous for having some really wild words and even sentences that don’t seem to have any vowels in them at all. For instance:

xłp̓x̣ʷłtłpłłskʷc̓  (xɬpʼχʷɬtʰɬpʰɬːskʷʰt͡sʼ in IPA)
He had a bunchberry plant.

seal fat

Here are some more odd words and sentences:


Nuyamłamkis timantx tisyuttx ʔułtimnastx.
The father sang the song to his son.

Musis tiʔimmllkītx taq̓lsxʷt̓aχ.
The boy felt that rope.

However, this word is not typically used by speakers and by no means do most words consist of all consonants. The language sounds odd when spoken. It has been described as “whispering while chewing on a granola bar” (see the video sample under Montana Salish below).

These wild consonant clusters are even crazier than the ones in Ubykh and NW Caucasian. In fact, the nutty consonant clusters in Salish and causing a debate in linguistics about whether or not the syllable is even a universal phenomenon in language as some Salish words and phrases appear to lack syllables. Some Berber dialects have raised similar questions about the syllable.

Nuxálk makes it onto lists of the craziest phonologies on Earth.

Nuxálk is rated 6, hardest of all.

Interior Salish

Montana Salish is said to be just as hard to learn as Nuxálk . Spokane (Montana Salish) has combining and independent forms with the same meaning:


Montana Salish makes it onto a lot of craziest grammars lists.

This link shows an elder on the Flathead Indian Reservation in Montana, Steven Smallsalmon, speaking Montana Salish. He also leads classes in the language. This is probably one of the strangest sounding languages on Earth.

Montana Salish is rated 6, hardest of all.


Straits Salish has an aspectual distinction between persistent and nonpersistent. Persistent means the activity continues after its inception as a state. The persistent morpheme is . The result is similar to English:

figure out – nonpersistent
know – persistent

look at – nonpersistent
watch – persistent

take – nonpersistent
hold – persistent

is referred to as a “parasitic morpheme” and only occurs in stem that has an underlying ə which serves as a “host” for the morpheme.

How strange.

The Saanich dialect of Straits Salish is often listed in the rogue’s gallery of craziest grammars on Earth. The writing system is often listed as one of the worst out there. In addition, Saanich makes it onto craziest grammars lists for the parasitic morphemes and for having no distinction between nouns and verbs!

Straits Salish gets a 6 rating, hardest of all.

Halkomelem, spoken by 570 people around Vancouver, British Colombia, is widely considered to be one of the hardest languages on Earth to learn. In Halkomelem, many verbs have an orientation towards water. You can’t just say, She went home. You have say how she was going home in relation to nearby bodies of water. So depending on where she was walking home in relation to the nearest river, you would say:

She was farther away from the water and going home.
She was coming home in the direction away from the water.
She was walking parallel to the flow of the water downstream.
She was walking parallel to the flow of the water upstream.

Halkomelem gets a 6 rating, hardest of all.


Lushootseed is said to be just as hard to learn as Nuxálk. Lushootseed is one of the few languages on Earth that has no nasals at all, except in special registers like baby talk and the archaic speech of mythological figures. It also has laryngealized glides and nasals: w ̰ , m̥ ̰ , and n̥ ̰ .

Lushootseed is rated 6, hardest of all.


All Iroquoian languages are extremely difficult, but Athabaskan is probably even harder. Siouan languages may be equal to Iroquoian in difficulty.

Compare the same phrases in Tlingit (Athabaskan) and and  Cherokee (Iroquoian).


kutíkusa‘áatIt’s cold outside.
It’s cold right now.

In Tlingit, you can add or modify affixes at the beginning as prefixes, in the middle as infixes and at the end as suffixes. In the above example, you changed a part of the word within the clause itself.


doyáditlv uyvtlvIt is cold outside. (Lit. Outside it is cold)
ka uyvtlv It is cold now. (Lit. Now it is cold.)

As you can see, Cherokee is easier.


Cherokee is very hard to learn. In addition to everything else, it has a completely different alphabet. It’s polysynthetic, to make matters worse. It is possible to write a Cherokee sentence that somehow lacks a verb. There are five categories of verb classifiers. Verbs needing classifiers must use one. Each regular verb can have an incredible 21,262 inflected forms! All verbs contain a verb root, a pronominal prefix, a modal suffix and an aspect suffix. In addition, verbs inflect for singular, plural and also dual. For instance:

ᎠᎸᎢᎭ   a'lv'íha 

You have 126 different forms:
ᎬᏯᎸᎢᎭ  gvyalv'iha     I tie you up
ᏕᎬᏯᎸᎢᎭ degvyalviha  I'm tying you up
ᏥᏯᎸᎢᎭ  jiyalv'ha        I tie him up
ᎦᎸᎢᎭ                          I tie it
ᏍᏓᏯᎸᎢᎭ sdayalv'iha  I tie you (dual)
ᎢᏨᏯᎢᎭ  ijvyalv'iha    I tie you (pl)
ᎦᏥᏯᎸᎢᎭ gajiyalv'iha  I tie them (animate)
ᏕᎦᎸᎢᎭ                        I tie them up (inanimate)
ᏍᏆᎸᎢᎭ  squahlv'iha    You tie me
ᎯᏯᎸᎢᎭ  hiyalv'iha     You're tying him
ᎭᏢᎢᎭ   hatlv'iha         You tie it
ᏍᎩᎾᎸᎢᎭ skinalv'iha    You're tying me and him
ᎪᎩᎾᏢᎢᎭ goginatlv'iha  They tie me and him etc.

Let us look at another form:

to see

I see myself           gadagotia
I see you                gvgohtia
I see him/               tsigotia
I see it                    tsigotia
I see you two          advgotia
I see you (plural)    istvgotia
I see them (live)    gatsigotia
I see them (things) detsigotia

You see me                     sgigotia
You see yourself              hadagotia
You see him/her              higo(h)tia
You see it                        higotia
You see another and me  sginigotia
You see others and me    isgigotia
You see them (living)      dehigotia
You see them (living)      gahigotia
You see them (things)     detsigotia

He/she sees me                    agigotia
He/she sees you                   tsagotia
He/she sees you                   atsigotia
He/she sees him/her            agotia
He/she sees himself/herself  adagotia
He/she sees you + me          ginigotia
He/she sees you two             sdigotia
He/she sees another + me    oginigotia
He she sees us (them + me) otsigotia
He/she sees you (plural)       itsigotia
He/she sees them                 dagotia

You and I see him/her/it                igigotia
You and I see ourselves                 edadotia
You and I see one another             denadagotia/dosdadagotia
You and I see them (living)           genigotia
You and I see them (living or not) denigotia

You two see me                           sgninigotia
You two see him/her/it                 esdigotia
You two see yourselves                sdadagotia
You two see us (another and me) sginigotia
You two see them                        desdigotia

Another and I see you             sdvgotia
Another and I see him/her       osdigotia
Another and I see it                 osdigotia
Another and I see you-two      sdvgotia
Another and I see ourselves    dosdadagotia
Another and I see you (plural) itsvgotia
Another and I see them           dosdigotia

You (plural) see me        isgigoti
You (plural) see him/her etsigoti

They see me                    gvgigotia
They see you                   getsagotia
They see him/her             anigoti
They see you and me       geginigoti
They see you two             gesdigoti
They see another and me gegigotia/gogenigoti
They see you (plural)       getsigoti
They see them                 danagotia
They see themselves       anadagoti

I will see datsigoi
I saw      agigohvi

He/she will see dvgohi
He/she             sawugohvi

Number is marked for inclusive vs. exclusive and there is a dual. 3rd person plural is marked for animate/inanimate. Verbs take different object forms depending on if the object is solid/alive/indefinite shape/flexible. This is similar to the Navajo system.

Cherokee also has lexical tone, with complex rules about how tones may combine with each other. Tone is not marked in the orthography. The phonology is noted for somehow not having any labial consonants.

However, Cherokee is very regular. It has only three irregular verbs. It is just that there are many complex rules.

Cherokee is rated 5.5, close to most difficult of all.

Northern Iroquoian
Five Nations-Huronian-Susquehannock

Wyandot, a dormant language that has been extinct for about 50 years, has some unbelievably complex structures. Let us look at one of them. Wyandot is the only language on Earth that allows negative sentences that somehow do not contain a negative morpheme. Wyandot makes it onto craziest grammars lists. (To be continued).

Mississippi Valley-Ohio Valley Siouan
Mississippi Valley Siouan

Lakota and other Siouan languages may well be as convoluted as Iroquoian. In Lakota, all adjectives are expressed as verbs. Something similar is seen in Nahuatl.

Ógle sápe kiŋ mak’ú.
The shirt it is black he gave it to me.
He gave me the black shirt.

In the above, it is black is a stative verb and serves as an adjective.

Ógle kiŋ sabyá mak’ú.
Shirt the blackly he gave it to me.
He gave me the black shirt. (Lit. He gave me the shirt blackly.)

Bkackly is an adverb serving as an adjective above.

Lakota gets a 5.5 rating, hardest of all.


All Algonquian languages have distinctions between animate/inanimate nouns, in addition to having proximate/obviate and direct/inverse distinctions. However, most languages that have proximate/obviate and direct/inverse distinctions are not as difficult as Algonquian.

Proximate/obviative is a way of marking the 3rd person in discourse. It distinguishes between an important 3rd person (proximate) and a more peripheral 3rd person (obviative). Animate nouns and possessor nouns tend to be marked proximate while inanimate nouns and possessed nouns tend to be marked obviative.

Direct/inverse is a way of marking discourse in terms of saliency, topicality or animacy. Whether one noun ranks higher than another in terms of saliency, topicality or animacy means that that nouns ranks higher in terms of person hierarchy. It is used only in transitive clauses. When the subject has a higher ranking than the object, the direct form is used. When the object has a higher ranking than the object, the inverse form is used.

Central Algonquian

Cree is very hard to learn. It are written in a variety of different ways with different alphabets and syllabic systems, complicating matters even further. The syllabic alphabet has many problems and is often listed as one of the worst scripts out there. They are both polysynthetic and have long, short and nasal vowels and aspirated and unaspirated voiceless consonants. Words are divided into metrical feet, the rules for determining stress placement in words are quite complex and there is lots of irregularity. Vowels fall out a lot, or syncopate, within words.

Cree adds noun classifiers to the mix, and both nouns and verbs are marked as animate or inanimate. In addition, verbs are marked for transitive and intransitive. In addition, verbs get different affixes depending on whether they occur in main or subordinate clauses.

Cree is rated 6, hardest of all.


Ojibwa is said to be about as hard to learn as Cree as it is very similar.

Ojibwa is rated 6, hardest of all.

Plains Algonquian

Cheyenne is well-known for being a hard Amerindian language to learn. Like many polysynthetic languages, it can have very long words.

I truly don’t know Cheyenne very well.

However, Cheyenne is quite regular, but has so many complex rules that it is hard to figure them all out.

Cheyenne is rated 6, hardest of all.


Arapaho has a strange phonology. It lacks phonemic low vowels. The vowel system consists of i, ɨ~,u, ɛ, and ɔ, with no low phonemic vowels. Each vowel also has a corresponding long version. In addition, there are four diphthongs, ei, ou, oe and ie, several triphthongs, eii, oee, and ouu, as well as extended sequences of vowels such as eee with stress on either the first or the last vowel in the combination. Long vowels of various types are common:

I will turn out the lights.

It is raining.

There is a pitch accent system with normal, high and allophonic falling tones. Arapaho words also undergo some very wild sound changes.

Arapaho is rated 6, hardest of all.

Gros Ventre has a similar phonological system and similar elaborate sound changes as Arapaho.

Gros Ventre is rated 5, hardest of all.


Wichita has many strange phonological traits. It has only one nasal. Labials are rare and appear in only two roots. It also may have only three vowels, i, e, and a, with only height as a distinction. Such a restricted vertical vowel distribution is only found in NW Caucasian and the Papuan Ndu languages. There is apparently a three-way contrast in vowel length – regular, long and extra-long.

This is only found in Mixe and Estonian. There are some interesting tenses. Perfect tense means that an act has been carried out. The strange intentive tense means that one hopes or hoped to to carry out an act. The habitual tense means one regularly engages in the activity, not that one is doing so at the moment.

Long consonant clusters are permitted.


while sleeping

There are many cases where a CVɁ sequence has been reduced to due to loss of the vowel, resulting in odd words such as:


Word order is ordered in accordance with novelty or importance.

hira:wisɁiha:s kiyari:ce:hire:
Our ancestors God put us on this Earth.

weɁe hira:rɁ tiɁi na:kirih
God put our ancestors on this Earth.

In the sentence above, “our ancestors” is actually the subject, so it makes sense that it comes first.

Wichita has inclusive and exclusive 3rd person plural and has singular, dual and plural. There is an evidential system where if you say you know something, you must say how you know it – whether it is personal knowledge or hearsay.

Wichita gets a 6 rating, hardest of all.

Coastal Chantal

Huamelutec or Lowland Oaxaca Chantal has the odd glottalized fricatives , , ɬʼ and as its only glottalized consonants. They alternate with plain f, s, l and x. , ɬʼ and are extremely rare in the world’s languages, usually only found in 2-3 other languages, often in NW Caucasian. occurs only in one other language – Tlingit. is slightly more common, occurring five other languages including Tlingit. In other languages, these odd sounds derived from sequences of consonant + q: Cq -> Cʔ -> glottalized fricative.

Sentence structure is odd:

Hit the ball the man.
Hit the man the ball.
The man hit the ball.

All mean the same thing.

Huamelutec gets a 6 rating, hardest of all.


Karok is a language isolate spoken by a few dozen people in northern California. The last native speaker recently died, however, there are ~80 who have varying levels of L2 fluency.

In Karok, you can use a suffix for different types of containment – fire, water or a solid.

throw into a fire

throw into water

throw through a solid

The suffixes are unrelated to the words for fire, water and solid.

Karok gets a 5 rating, hardest of all.


Hopi is so difficult that even grammars describing the language are almost impossible to understand. For instance, Hopi has two different words for and depending on whether the noun phrase containing the word and is nominative or accusative.

Hopi is rated 6, hardest of all.

Southern Uto-Aztecan
Core Nahua

In Nahuatl, most adjectives are simply stative verbs. Hence:

Umntu omde waya eTenochtitlan.
The man he is tall went to Tenochtitlan.
The tall man went to Tenochtitlan.

He is tall is a stative verb in the above.

Nahuatl gets a 6 rating, hardest of all.

Central Numic

Comanche is legendary for being one of the hardest Indian languages of all to learn. Reasons are unknown, but all Amerindian languages are quite difficult. I doubt if Comanche is harder than other Numic languages.

Bizarrely enough, Comanche has very strange sounds called voiceless vowels, which seems to be an oxymoron, as vowels would seem to be inherently voiced. English has something akin to voiceless vowels in the words particular and peculiar, where the bolded vowels act something akin to a voiceless vowel.

Comanche was used for a while by the codespeakers in World War 2 – not all codespeakers were Navajos. Comanche was specifically chosen because it was hard to figure out. The Japanese were never able to break the Comanche code.

Comanche is rated 6, hardest of all.

Western Oto-Mangue

Chinantec, an Indian language of southwest Mexico, is very hard for non-Chinantecs to learn. The tone system is maddeningly complex, and the syntax and morphology are very intricate.

Chinantec is rated 6, hardest of all.

Lowland Valley

Jalapa Mazatec has distinctions between modal, creaky, breathy-voiced vowels along with nasal versions of those three. It also has creaky consonants and voiceless nasals. It has three tones, low, mid and high. Combining the tones results in various contour tones. In addition, it has a 3-way distinction in vowel length. Whistled speech is also possible. It has a phonemic distinction between “ballistic” and “controlled” syllables which is only present on Oto-Manguean.

Ballistic (short)
you plural

Controlled (half-long)
– six

Jalapa Mazatec is rated 6, hardest of all.

Upper Amazon
Eastern Nawiki

Tariana is a very difficult language mostly because of the unbelievable amount of information it crams into its morphology and syntax. This is mostly because it is an Arawakan language that has been heavily influenced by neighboring Tucanoan languages, with the result that it has many of the grammatical categories and particles present in both families.

This stems from the widespread bilingualism in the Vaupes Basin of Colombia, where many people grow up bilingual from childhood and often become multilingual by adulthood. Learning up to five different languages is common. Code-switching was frowned upon and anyone using a word from Language Y while speaking Language X would get laughed at. Hence the various languages tended to borrow features from each other quite easily.

For instance, Tariana has both a noun classifier system and a gender system. Noun classifiers and gender are sometimes subsumed under the single category of “noun classifiers.” Yet Tariana has both, presumably from its relationship to two completely different language families. So in Tariana is not unusual to get both demonstratives and verbs marked for both gender and noun classifier. Tariana borrowed such things as serialized perception verbs and the dubitative marker from Tucano.

In addition, Tariana has some very odd sounds, including aspirated nasals mh (), nh (n̺ʰ) and ñh (ɲʰ) and an aspirated w () of all things. They seem to be actually aspirated, not just partially devoiced as many voiceless nasals and liquids are.

Tariana gets 6, hardest of all.


Bora, a Wintotoan language spoken in Peru and Colombia near the border between the two countries, has a mind-boggling 350 different noun classes. The noun classifier system is actually highly productive and is often used to create new nouns. New nouns can be created very easily, and their meanings are often semantically transparent. In some noun classifier systems, classifiers can be stacked one upon the other. In these cases, typically the last one is used for agreement purposes.

Bora also is a tonal language, but it has only two tones. In addition, nearly all consonantal phonemes have phonemic aspirated and palatalized counterparts. The agreement structure in the language is also quite convoluted. The classifier system effectively replaces much derivational morphology on the noun and noun compounding processes that other languages use to expand the meanings of nominals.

Bora gets a 6 rating, hardest of all.

Eastern Tucanoan

Tuyuca is a Tucanoan language spoken in by 450 people in the department of Vaupés in Colombia. An article in The Economist magazine concluded that it was the hardest language on Earth to learn.

It has a simple sound system, but it’s agglutinative, and agglutinative languages are pretty hard. For instance, hóabãsiriga means I don’t know how to write. It has two forms of 1st person plural, I and you (inclusive) and I and the others (exclusive). It has between 50-140 noun classes, including strange ones like bark that does not cling closely to a tree, which can be extended to mean baggy trousers or wet plywood that has begun to fall apart.

Like Yamana, a nearly extinct Amerindian language of Chile, Tuyuca marks for evidentiality, that is, how it is that you know something. For instance:

Diga ape-wi. = The boy played soccer. (I saw him playing).
Diga ape-hiyi.
= The boy played soccer. (I assume he was playing soccer, though I did not see it firsthand).

Evidential marking is obligatory on all Tuyuca verbs and it forces you to think about how you know whatever it is you know.

Tuyuca definitely gets a 6 rating!

Central Tucanoan

Cubeo, a language spoken in the Vaupes of Colombia, has a small closed class of adjective roots similar to Juǀʼhoan below:


However, verbs can function as adjectives, and the adjective roots can either turn into nouns themselves or they can take the inflections of either nouns or verbs. Wild!

Similar to how the grammar of Tariana has been influenced by Tucano languages, the grammar of Tucanoan Cubeo has been influenced by neighboring Arawakan languages. The grammar has been described as either SOV or OVS. That would mean that the following:

The man the ball hit.
The ball hit the man.

Mean the same things. OVS languages are quite rare.

Morphemes belong to one of four classes:

  1. Nasal (many roots, as well as suffixes like -xã  = associative)
  2. Oral (many roots, as well as suffixes like -pe  = similarity, -du = frustrative)
  3. Unmarked (only suffixes, e.g. -re  = in/direct object)
  4. Oral/Nasal (some roots and some suffixes) /bãˈkaxa-/(mãˈkaxa-) – to defecate and -kebã = suppose

Just by looking at any given consonant-initial suffix, it is impossible to determine which of the first three categories it belongs to. They must be learned one by one.

Cubeo has nasal assimilation, common to many Amazonian languages. In some of these, nasalization is best analyzed at the syllable level – some syllables are nasal and others are not.

She recently went.

The underlying form dĩ-bI-ko is realized on the surface as nĩmĩko. The ĩ in dĩ-bI-ko nasalizes the d, the b, and the I on either side of it, so nasal spreading works in both directions. However, it is blocked from the third syllable because k is part of a class of non-nasalizable consonants.

Pretty difficult language.

Cuneo gets a 6 rating, hardest of all.


Hixkaryána is famous for being the only language on Earth to have basic OVS (Object-Verb-Subject) word order.

The sentence Toto yonoye kamara, or The man ate the jaguar, actually means The jaguar ate the man.

Toto yonoye kamara
Lit. The man ate the jaguar.
Gloss: The jaguar ate the man.

Grammatical suffixes attached to the end of the verb mark not only number but also aspect, mood and tense.

Hixkaryána gets a 6 rating, hardest of all.


This is actually a series of closely related languages as opposed to one language, but the Southern Nambikwara language is the most well-known of the family, with 1,200 speakers in the Brazilian Amazon.

Phonology is complex. Consonants distinguish between aspirated, plain and glottalized, common in the Americas. There are strange sounds like prestopped nasals glottalized fricatives. There are nasal vowels and three different tones. All vowels except one have both nasal, creaky-voiced and nasal-creaky counterparts, for a total of 19 vowels.

The grammar is polysynthetic with a complex evidential system.

Reportedly, Nambikwara children do not pick up the language fully until age 10 or so, one of the latest recorded ages for full competence. Nambikwara is sometimes said to be the hardest language on Earth to learn, but it has some competition.

Nambikwara definitely gets a 6 rating, hardest of all!


Pirahã is a language isolate spoken in the Brazilian Amazon. Recent writings by Daniel Everett indicate that not only is this one of the hardest languages on Earth to learn, but it is also one of the weirdest languages on Earth. It is monumentally complex in nearly every way imaginable. It is commonly listed on the rogue’s gallery of craziest languages and phonologies on Earth.

It has the smallest phonemic inventory on Earth with only seven consonants, three vowels and either two or three tones. Everett recently wrote a paper about it after spending many years with them. Previous missionaries who had spent time with the Pirahã generally failed to learn the language because it was too hard to learn. It took Everett a very long time, but he finally learned it well.

Many of Everett’s claims about Pirahã are astounding: whistled speech, no system for counting, very few Portuguese loans (they deliberately refuse to use Portuguese loans) evidence for the Sapir-Whorf linguistic relativity hypothesis, and evidence that it violates some of Noam Chomsky’s purported language universals such as embedding. It also has the t͡ʙ̥ sound – a bilabially trilled postdental affricate which is only found in two other languages, both in the Brazilian Amazon – Oro Win and Wari’.

Initially, Everett never heard the sound, but they got to know him better, they started to make it more often. Everett believes that they were ridiculed by other groups when they made the odd sound.

Pirahã has the simplest kinship system in any language – there is only word for both mother and father, and the Pirahã do not have any words for anyone other than direct biological relatives.

Pirahã may have only two numerals, or it may lack a numeral system altogether.

Pirahã does not distinguish between singular and plural person. This is highly unusual. The language may have borrowed its entire pronoun set from the Tupian languages Nheengatu and Tenarim, groups the Pirahã had formerly been in contact with. This may be one of the only attested case of the borrowing of a complete pronoun set.

There are mandatory evidentiality markers that must be used in Pirahã discourse. Speakers must say how they know something, whether they saw it themselves, whether it was hearsay or whether they inferred it circumstantially.

There are various strange moods – the desiderative (desire to perform an action) and two types of frustrative – frustration in starting an action (inchoative/incompletive) and frustration in completing an action (causative/incompletive). There are others: immediate/intentive (you are going to do something now/you intend to do it in the future)

There are many verbal aspects: perfect/imperfect (completed/incomplete) telic/atelic (reaching a goal/not reaching a goal), continuative (continuing), repetitive (iterative), and beginning an action (inchoative).

Each Pirahã verb has 262,144 possible forms, or possibly in the many millions, depending on which analysis you use.

The future tense is divided into future/somewhere and future/elsewhere. The past tense is divided into plain past and immediate past.

Pirahã has a closed class of only 90 verb roots, an incredibly small number. But these roots can be combined together to form compound verbs, a much larger category. Here is one example of three verbs strung together to form a compound verb:

xig ab op
take turn go
bring back, You take something away, you turn around, and you go back to where you got it to return it.

There are no abstract color terms in Pirahã. There are only two words for colors, one for light and one for dark. The only other languages with this restricted of a color sense are in Papua New Guinea. The other color terms are not really color terms, but are more descriptive – red is translated as like blood.

Pirahã can be whistled, hummed or encoded into music. Consonants and vowels can be omitted altogether and meaning conveyed instead via variations in stress, pitch and rhythm. Mothers teach the language to children by repeating musical patterns.

Pirahã may well be one of the hardest languages on Earth to learn.

Pirahã gets a 6 rating, hardest of all.


Quechua (actually a large group of languages and not a single language at all) is one of the easiest Amerindian languages to learn. Quechua is a classic example of a highly regular grammar with few exceptions. Its agglutinative system is more straightforward than even that of Turkish. The phonology is dead simple.

On the down side, there is a lot of dialectal divergence (these are actually separate languages and not dialects) and a lack of learning materials. Some say that Quechua speakers spend their whole lives learning the language.

Quechua has inconsistent orthographies. There is a fight between those who prefer a Spanish-based orthography and those who prefer a more phonemic one. Also there is an argument over whether to use the Ayacucho language or the Cuzco language as a base.

Quechua has a difficult feature known as evidential marking. This marker indicates the source of the speaker’s knowledge and how sure they are about the statement.

-mi expresses personal knowledge:

Tayta Wayllaqawaqa chufirmi.
Mr. Huayllacahua is a driver. (I know it for a fact.)

-si expresses hearsay knowledge:

Tayta Wayllaqawaqa chufirsi.
Mr. Huayllacahua is a driver (or so I’ve heard).

chá expresses strong possibility:

Tayta Wayllaqawaqa chufirchá.
Mr. Huayllacahua is a driver (most likely).

Quechua is rated 4, very difficult.


Aymara has some of the wildest morphophonology out there. Morpheme-final vowel deletion is present in the language as a morphophonological process, and it is dependent on a set of highly complex phonological, morphological and syntactic rules (Kim 2013).

For instance, there are three types of suffixes: dominant, recessive and a 3rd class is neither dominant nor recessive. If a stem ends in a vowel, dominant suffixes delete the vowel but recessive suffixes allow the vowel to remain. The third class either deletes or retains the vowel on the stem depending on how many vowels are in the stem. If the root has two vowels, the vowel is retained. If it has three vowels, the vowel is deleted.

Although all of this seems quite odd, Finnish has something similar going on, if not a lot worse.

Nevertheless, Aymara is still said to be a very easy language to learn. The Guinness Book of World Records claims it is almost as easy to learn as Esperanto.

Aymara gets a 2 rating, very easy to learn.


Australian Aborigine languages are some of the hardest languages on Earth to learn, like Amerindian or Caucasian languages. Some Australian languages have phonemic contrasts that few other languages have, such as apico-dental, lamino-dental, apico-post-alveolar, and lamino-postalveolar cononals.

Australian languages tend to be mixed ergative. Ordinary nouns are ergative-absolutive, but 1st and 2nd person pronouns are nominative-accusative. One language has a three way agent-patient-experiencer distinction in the 1st person pronoun. Australian pronouns typically have singular, plural and dual forms along with inclusive and exclusive 1st plural. In some sentences, they have what is known as double case agreement which is rare in the world’s languages:

I gave a spear to my father.
I gave a spear mine-to father’s-to.

Both elements of the phrase my father are in both dative and genitive.

However, Aboriginal languages do have the plus of being very regular.

All Australian languages are rated 6, most difficult of all.


Berik is a Tor-Orya language spoken in Indonesian colony of Irian Jaya in New Guinea.

Verbs take many strange endings, in many cases mandatory ones, that indicate what time of day something happened, among other things.

TelbenerHe drinks in the evening.

Where a verb takes an object, it will not only be marked for time of day but for the size of the object.

KitobanaHe gives three large objects to a man in the sunlight.

Verbs may also be marked for where the action takes place in reference to the speaker.

GwerantenaTo place a large object in a low place nearby.

Berik is rated 6, hardest of all.

Trans New Guinea

Amele is the world’s most complex language as far as verb forms go, with 69,000 finitive and 860 infinitive forms.

Amele is rated 6, hardest of all.


Valman is a bizarre case where the word and that connects two nouns is actually a verb of all things and is marked with the first noun as subject and the second noun as object.

John (subject) and Mary (object)

John is marked as subject for some reason, and Mary is marked as object, and the and word shows subject agreement with John and object agreement with Mary.

Valman gets a 6 rating, hardest of all.


Semitic languages such as Arabic and Hebrew are notoriously difficult to learn, and Arabic (especially MSA) tops many language learners’ lists as the hardest language they have ever attempted to learn. Although Semitic verbs are notoriously complex, the verbal system does have some advantages especially as compared to IE languages like Slavic. Unlike Slavic, Semitic verbs are not inflected for mood and there is no perfect or imperfect.


Arabic has some very irregular manners of noun declension, even in the plural. For instance, the word girls changes in an unpredictable way when you say one girl, two girls and three girls, and there are two different ways to say two girls depending on context. Two girls is marked with the dual, but different dual forms can be used. All languages with duals are relatively difficult for most speakers that lack a dual in their native language. However, the dual is predictable from the singular, so one might argue that you only need to learn how to say one girl and three girls.

Further, it is full of irregular plurals similar to octopus and octopi in English, whereas these forms are rare in English. With any given word, there might be 20 different possible ways to pluralize it, and there is no way to know which of the 20 paradigms to use with that word, and further, there is no way to generalize a plural pattern from a singular pattern. In addition, many words have 2-3 ways of pluralizing them. Some messy Arab plurals:

kalb -> kilaab
-> quluub
-> makaatib
-> tullaab
-> buldaan

When you say I love you to a man, you say it one way, and when you say it to a woman, you say it another way. On and on.

The Arabic writing system is exceeding difficult and is more of the hardest to use of any on Earth. Soft vowels are omitted. You have to learn where to insert missing vowels, where to double consonants and which vowels to skip in the script. There are 28 different symbols in the alphabet and four different ways to write each symbol depending on its place in the word.

Consonants are written in different ways depending on where they appear in a word. An h is written differently at the beginning of a word than at the end of a word. However, one simple aspect of it is that the medial form is always the same as the initial form. You need to learn not only Arabic words but also the grammar to read Arabic.

Pronouns attach themselves to roots, and there are many different verb conjugation paradigms which simply have to be memorized. For instance, if a verb has a و, a ي, or a ء  in its root, you need to memorize the patters of the derivations, and that is a good chunk of the conjugations right there. The system for measuring quantities is extremely confusing.

The grammar has many odd rules that seem senseless. Unfortunately, most rules have exceptions, and it seems that the exceptions are more common than the rules themselves. Many people, including native speakers, complain about Arabic grammar.

Arabic does have case, but the system is rather simple.

The laryngeals, uvulars and glottalized sounds are hard for many foreigners to make and nearly impossible for them to get right. The ha’(ح ), qa (ق ) and غ sounds and the glottal stop in initial position give a lot of learners headaches.

Arabic is at least as idiomatic as French or English, so it order to speak it right you have to learn all of the expressionistic nuances.

One of the worst problems with Arabic is the dialects, which in many cases are separate languages altogether. If you learn Arabic, you often have to learn one of the dialects along with classical Arabic. All Arabic speakers speak both an Arabic dialect and Classical Arabic.

In some Arabic as a foreign language classes, even after 1 1/2 years, not one student could yet make a complete and proper sentence that was not memorized.

Adding weight to the commonly held belief that Arabic is hard to learn is research done in Germany in 2005 which showed that Turkish children learn their language at age 2-3, German children at age 4-5, but Arabic kids did not get Arabic until age 12.

Arabic has complex verbal agreement with the subject, masculine and feminine gender in nouns and adjectives, head-initial syntax and a serious restriction to forming compounds. If you come from a language that has similar nature, Arabic may be easier for you than it is for so many others. Its 3 vowel system makes for easy vowels.

MSA Arabic is rated 5, extremely difficult.

Arabic dialects are often somewhat easier to learn than MSA Arabic. At least in Lebanese and Egyptian Arabic, the very difficult q’ sound has been turned into a hamza or glottal stop which is an easier sound to make. Compared to MSA Arabic, the dialectal words tend to be shorter and easier to pronounce.

To attain anywhere near native speaker competency in Egyptian Arabic, you probably need to live in Egypt for 10 years, but Arabic speakers say that few if any second language learners ever come close to native competency. There is a huge vocabulary, and most words have a wealth of possible meanings.

Egyptian Arabic is rated 4.5, very to extremely difficult.

Moroccan Arabic is said to be particularly difficult, with much vowel elision in triconsonantal stems. In addition, all dialectal Arabic is plagued by irrational writing systems.

Moroccan Arabic is rated 4.5, very to extremely difficult.

Maltese is a strange language, basically a Maghrebi Arabic language (similar to Moroccan or Tunisian Arabic) that has very heavy influence from non-Arabic tongues. It shares the problem of Gaelic that often words look one way and are pronounced another.

It has the common Semitic problem of difficult plurals. Although many plurals use common plural endings (-i, -iet, -ijiet, -at), others simply form the plural by having their last vowel dropped or adding an s (English borrowing). There’s no pattern, and you simply have to memorize which ones act which way. Maltese permits the consonant cluster spt, which is surely hard to pronounce.

On the other hand, Maltese has quite a few IE loans from Italian, Sicilian, Spanish, French and increasingly English. If you have knowledge of Romance languages, Maltese is going to be easier than most Arabic dialects.

Maltese is rated 4, very difficult.


Hebrew is hard to learn according to a number of Israelis. Part of the problem may be the abjad writing system, which often leaves out vowels which must simply be remembered. Also, other than borrowings, the vocabulary is Afroasiatic, hence mostly unknown to speakers of IE languages. There are also difficult consonants as in Arabic such as pharyngeals and uvulars.

The het or glottal h is particularly hard to make. However, most modern Israelis no longer make the het sound or a’ain sounds. Instead, they pronounce the het like the chaf sound and the a’ain like an alef. Almost all Ashkenazi Israeli Jews no longer use the het or a’ain sounds. But most Jews who came from Arab countries (often older people) still use the sound, and some of their children do (Dorani 2013).

Hebrew has complex morphophonological rules. The letters p, b, t, d, k and g change to v, f, dh, th, kh and gh in certain situations. In some environments, pharyngeals change the nature of the vowels around them. The prefix ve-, which means and, is pronounced differently when it precedes certain letters. Hebrew is also quite irregular.

Hebrew has quite a few voices, including active, passive, intensive, intensive passive, etc. It also has a number of tenses such as present, past and the odd juissive.

Hebrew also has two different noun classes. There are also many suffixes and quite a few prefixes that can be attached to verbs and nouns.

Even most native Hebrew speakers do not speak Hebrew correctly by a long shot.

Quite a few say Hebrew is as hard to learn as MSA or perhaps even harder, but this is controversial.

Hebrew gets a 5 rating for extremely difficult.


Berber languages are considered to be very hard to learn. Worse, there are very few language learning resources available.

Tamazight allows doubled consonants at the beginning of a word! How can you possibly make that sound?

Tamazight gets a 6 rating, hardest of all.

In Tachelhit , words like this are possible:

You took it off.

You gave it.

In addition, there are words which contain only one or two consonants:


feed on

Tachelhit gets a 6 rating, hardest of all.


Amharic is said to be a very hard language to learn. It is quite complex, and its sentence structures seem strange even to speakers of other Semitic languages. Hebrew speakers say they have a hard time with this language.

There are a multitude of rules which almost seem ridiculous in their complexity, there are numerous conjugation patterns, objects are suffixed to the verb, the alphabet has 274 letters, and the pronunciation seems strange. However, if you already know Hebrew or Arabic, it will be a lot easier. The hardest part of all is the verbal system, as with any Semitic language. It is easier than Arabic.

Amharic gets a 4.5 rating, very hard to extremely hard.

East Cushitic

Dahalo is legendary for having some of the wildest consonant phonology on Earth. It has all four airstream mechanisms found in languages: ejectives, implosives, clicks and normal pulmonic sounds. There are both glottal and epiglottal stops and fricatives and laminal and apical stops.

There is also a strange series of nasal clicks and are both glottalized and plain. Some of these clicks are also labialized. It has both voiced and unvoiced prenasalized stops and affricates, and some of the stops are also labialized. There is a weird palatal lateral ejective. There are three different lateral fricatives, including a labialized and palatalized one, and one lateral approximant. It contrasts alveolar and palatal lateral affricates and fricatives, the only language on Earth to do this.

The Dahalo are former elephant hunting hunter gatherers who live in southern Kenya. It is believed that at one time they spoke a language like Sandawe or Hadza, but they switched over to Cushitic at some point. The clicks are thought to be substratum from a time when Dahalo was a Sandawe-Hadza type language.

Dahalo gets a 6 rating, hardest of all.


Somali has one of the strangest proposition systems on Earth. It actually has no real prepositions at all. Instead it has preverbal particles and possessives that serve as prepositions.

Here is how possessives serve as prepositions:

habeennimada horteeda
the night her front
before nightfall

kulaylka dartiisa
the heat his reason
because of the heat

Here we have the use of a preverbal particle serving as a preposition:

kú ríd shandádda
Into put the suitcase.
Put it into the suitcase.

Somali combines four “prepositions” with four deictic particles to form its prepositions.

There are four basic “prepositions”:


These combine with a four different deictic particles:

toward the speaker
away from the speaker
toward each other
away from each other

Hence you put the “prepositions” and the deictic particles together in various ways. Both tend to go in front of and close to the verb:

Nínkíi bàan cèelka xádhig kagá sóo saaray.
…well-the rope with-from towards-me I-raised.
I pulled the man out of the well with a rope.

Way inoogá warrámi jireen.
They us-to-about news gave.
They used to give us news about it.

Prepositions are the hardest part of the Somali language for the learner.

Somali deals with verbs of motion via deixis in a similar way that Georgian does. One reference point is the speaker and the other is any other entities discussed. Verbs of motion are formed using adverbs. Entities may move:

towards each other    wada
away from each other  kala
towards the speaker   so
away from the speaker si


kala durka separate
si gal     go in (away from the speaker)
so gal     come in (toward the speaker)

Somali lacks orthographic consistency. There are four different orthographic systems in use – the Wadaad Arabic script, the Osmanya Ethiopic script, the Borama script and the Latin Somali alphabet, the current system.

All of the difficult sounds of Arabic are also present in Somali, another Semitic language – the alef, the ha, the qaf and the kha. There are long and short vowels.  There is a retroflex d, the same sound found in South Indian languages. Somali also has 2 tones – high and low. For some reason, Somali tends to make it onto craziest phonologies lists.

Somali pluralization makes no sense and must be memorized. There are seven different plurals, and there is no clue in the singular that tells you what form to use in the plural. See here:


áf  (language) -> afaf


hoóyo (mother) -> hoyoóyin

áabbe -> aabayaal

Note the tone shifts in all three of the plurals above.

There are four cases, absolutive, nominative, genitive and vocative. Despite the presences of absolutive and nominative cases, Somali is not an ergative language. Absolutive case is the basic case of the noun, and nominative is the case given to the noun when a verb follows in the sentence. There are different articles depending on whether the noun was mentioned previously or not (similar to the articles a and the in English). The absolutive and nominative are marked not only on the noun but also on the article that precedes it.

In terms of difficulty, Somali is much harder than Persian and probably about as difficult as Arabic.

Somali gets a 5 rating, extremely hard to learn.


Malayalam, a Dravidian language of India, was has been cited as the hardest language to learn by an language foundation, but the citation is obscure and hard to verify.

Malayalam words are often even hard to look up in a Malayalam dictionary.

For instance, adiyAnkaLAkkikkoNDirikkukayumANello is a word in Malayalam. It means something like I, your servant, am sitting and mixing s.t. (which is why I cannot do what you are asking of me). The part in parentheses is an example of the type of sentence where it might be used.

The above word is composed of many different morphemes, including conjunctions and other affixes, with sandhi going on with some of them so they are eroded away from their basic forms. There doesn’t seem to be any way to look that word up or to write a Malayalam dictionary that lists all the possible forms, including forms like the word above. It would probably be way too huge of a book. However, all agglutinative languages are made up of affixes, and if you know the affixes, it is not particularly hard to parse the word apart.

Malayalam is said to be very hard to pronounce correctly.

Further, few foreigners even try to learn Malayalam, so Malayalam speakers, like the French, might not listen to you and might make fun of you if your Malayalam is not native sounding.

However, Malayalam has the advantage of having many pedagogic materials available for language learning such as audio-visual material and subtitled videos.

Malayalam is rated 5, extremely difficult.


Tamil, a Dravidian language is hard, but probably not as difficult as Malayalam is. Tamil has an incredible 247 characters in its alphabet. Nevertheless, most of those are consonant-vowel combinations, so it is almost more of a syllabary than an alphabet. Going by what would traditionally be considered alphabetic symbols, there are probably only 72 real symbols in the alphabet. Nevertheless, Tamil probably has one of the easier Indic scripts as Tamil has fewer characters than other scripts due to its lack of aspiration. Compare to Devanagari’s over 1,000 characters.

But no Indic script is easy. A problem with Tamil is that all of the characters seem to look alike. It is even worse than Devanagari in that regard. However, the more rounded scripts such as Kannada, Sinhala, Telegu and Malayalam have that problem to a worse degree. Tamil has a few sharp corners in the characters that helps to disambiguate them.

In addition, as with other languages, words are written one way and pronounced another. However, there are claims that the difficulty of Tamil’s diglossia is overrated.

Tamil has two different registers for written and spoken speech, but the differences are not large, so this problem is exaggerated. Both Tamil and Malayalam are spoken very fast and have extremely complicated, nearly impenetrable scripts. If Westerners try to speak a Dravidian language in south India, more often than not the Dravidian speaker will simply address them in English rather than try to accommodate them.

Tamil has the odd evidential mood, similar to Bulgarian.

However, on the plus side, the language does seem to be very logical and regular, almost like German in that regard. In addition, there are a lot of language learning materials for Tamil.

Tamil is rated 4, very difficult.


Most agree that Korean is a hard language to learn.

The alphabet, Hangul at least is reasonable; in fact, it is quite elegant. But there are four different Romanizations- Lukoff, Yale, Horne, and McCune-Reischauer – which is preposterous. It’s best to just blow off the Romanizations and dive straight into Hangul. This way you can learn a Romanization later, and you won’t mess up your Hangul with spelling errors, as can occur if you go from Romanization to Hangul.

Hangul can be learned very quickly, but learning to read Korean books and newspapers fast is another matter altogether because you really need to know the hanja or Chinese character that are used in addition to the Hangul. After World War 2, the Koreas decided to officially get rid of their Chinese characters, but in practice this was not successful. With the use of Chinese characters in Korean, you can be a lot more precise in terms what you are trying to communicate.

Bizarrely, there are two different numeral sets used, but one is derived from Chinese so it should be familiar to Chinese, Japanese or Thai speakers who use similar or identical systems.

Korean has a wealth of homonyms, and this is one of the tricky aspects of the language. Any given combination of a couple of characters can have multiple meanings. Japanese has a similar problem with homonyms, but at least with Japanese you have the benefit of kanji to help you tell the homonyms apart. With Korean Hangul, you get no such advantage.

Similarly, there seem to be many ways to say the same thing in Korean. The learner will feel when people are using all of these different ways of saying the same thing that they are actually saying something different each time, but that is not the case.

One problem is that the bp, j, ch, t and d are pronounced differently than their English counterparts. The consonants, the pachim system and the morphing consonants at the end of the word that slide into the next word make Korean harder to pronounce than any major European language. Korean has a similar problem with Japanese, that is, if you mess up one vowel in sentence, you render it incomprehensible.

The vocabulary is very difficult for an English speaker who does not have knowledge of either Japanese or Chinese. On the other hand, Japanese or Chinese will help you a lot with Korean.

Korean is agglutinative and has a subject-topic discourse structure, and the logic of these systems is difficult for English speakers to understand. In addition, there are hundreds of ways of conjugating any given verb based on tense, mood, age or seniority. Adjectives also decline and take hundreds of different suffixes.

Meanwhile, Korean has an honorific system that is even wackier than that of Japanese. A single sentence can be said in three different ways depending on the relationship between the speaker and the listener. However, the younger generation is not using the honorifics so much, and a foreigner isn’t expected to know the honorific system anyway.

Maybe 60% of the words are based on Chinese words, but unfortunately, much of this Chinese-based vocabulary intersects with Japanese versions of Chinese words in a confusing way.

Speakers of Korean can learn Japanese fairly easily. Korean seems to be a more difficult language to learn than Japanese. There are maybe twice as many particles as in Japanese, the grammar is dramatically more difficult and the verbs are quite a bit harder. The phonemic inventory in Korean is also larger and includes such oddities as double consonants.

Korean is rated by language professors as being one of the hardest languages to learn.

Korean is rated 5, extremely hard.


Japanese also uses a symbolic alphabet, but the symbols themselves are sometime undecipherable in that even Japanese speakers will sometimes encounter written Japanese and will say that they don’t know how to pronounce it. I don’t mean that they mispronounce it; that would make sense. I mean they don’t have the slightest clue how to say the word! This problem is essentially nonexistent in a language like English.

The Japanese orthography is one of the most difficult to use of any orthography.

There are over 2,000 frequently used characters in three different symbolic alphabets that are frequently mixed together in confusing ways. Due to the large number of frequently used symbols, it’s said that even Japanese adults learn a new symbol a day a ways into adulthood.

The Japanese writing system is probably crazier than the Chinese writing system and it often makes it onto lists of worst orthographies. The very idea of writing an agglutinative language in a combination of two syllabaries and an ideography seems wacky right off the bat. Japanese borrowed Chinese characters. But then they gave each character several pronunciations, and in some cases as many as 24. Next they made two syllabaries using another set of characters, then over the next millennia came up with all sorts of contradictory and often senseless rules about when to use the syllabaries and when to use the character set. Later on they added a Romanization to make things even worse.

Chinese uses 5-6,000 characters regularly, while Japanese only uses around 2,000. But in Chinese, each character has only one or maybe two pronunciations. In Japanese, there are complicated rules about when and how to combine the hiragana with the characters. These rules are so hard that many native speakers still have problems with them. There are also personal and place names (proper nouns) which are given completely arbitrary pronunciations often totally at odds with the usual pronunciation of the character.

There are some writers, typically of literature, who deliberately choose to use kanji that even Japanese people cannot read. For instance, Ryuu  Murakami  uses the odd symbols 擽る、, 轢く、and 憑ける.

The Japanese system is made up of three different systems: the katakana and hiragana (the kana) and the kanji, similar to the hanzi used in Chinese. Chinese has at least 85,000 hanzi. The number of kanji is much less than that, but kanji often have more than one meaning in contrast to hanzi.

After WW2, Japan decided to simplify its language. They both simplified and reduced the number of Chinese characters used, and they unified the written and spoken language, which previously had been different.

Speaking Japanese is not as difficult as everyone says, and many say it’s fairly easy. However, there is a problem similar to English in that one word can be pronounced in multiple ways, like read and read in English.

A common problem is that a perfectly grammatically correct sentence uttered by a Japanese language learner, while perfectly correct, is still not acceptable by Japanese speakers because “we just don’t say it that way.” The Japanese speaker often cannot tell why the unacceptable sentence you uttered is not ok. On the other hand, this problem may be common to more languages than Japanese.

There is also a class of Japanese called “honorifics” or “keigo” that is quite hard to master. Honorifics are meant to show respect and to indicate one’s place or status in the social hierarchy. These typically effect verbs but can also affect particles and prefixes. They are usually formed by archaic or highly irregular verbs. However, there are both regular and irregular honorific forms. Furthermore, there are five different levels of honorifics. Honorifics vary depending on who you are and who you are talking to. In addition, gender comes into play.

Although it is true the Japanese young people are said to not understand the intricacies of keigo, it is still expected that they know how to speak this well. Consequently, many young Japanese will opt out of certain conversations because they feel that their keigo is not very good. Books explaining how to use keigo properly have been big sellers among young people in Japan in recent years as young people try to appear classy, refined or cultured.

In addition, Japanese born overseas (especially in the US), while often learning Japanese pretty well, typically have a very poor understanding of keigo. Instead of embarrassing themselves by not using keigo or using it wrong, these Japanese speakers often prefer to speak in English to Japanese people rather than bother with keigo-less Japanese. Overcorrection in keigo is also a problem when hypercorrection leads to someone making errors in keigo due to “trying to hard.” This looks like phony or insincere politeness and is often worse than not using keigo at all.

One wild thing about Japanese is counting forms. You actually use different numeral sets depending on what it is you are counting! There are dozens of different ways of counting things which involve the use of a complex numerical noun classifier system.

Japanese grammar is often said to be simple, but that does not appear to be the case on closer examination. Particles are especially vexing. Verbs engage in all sorts of wild behavior, and adverbs often act like verbs. Nouns can act like adjectives and adverbs. Meanwhile, honorifics change the behavior of all words. There are particles like ha and ga that have many different meanings. One problem is that all noun modifiers, even phrases, must precede the nouns they are modifying.

It’s often said that Japanese has no case, but this is not true. Actually, there are seven cases in Japanese. The aforementioned ga is a clitic meaning nominative, made is terminative case, -no is genitive and -o is accusative.

In this sentence:

The plane that was supposed to arrive at midnight, but which had been delayed by bad weather, finally arrived at 1 AM.

Everything underlined must precede the noun plane:

Was supposed to arrive at midnight, but had been delayed by bad weather, the plane finally arrived at 1 AM.

One of the main problems with Japanese grammar is that it is going to seem to so different from the sort of grammar and English speaker is likely to be used to.

Speaking Japanese is one thing, but reading and writing it is a whole new ballgame. It’s perfectly possible to know the meaning of every kanji and the meaning of every word in a sentence, but you still can’t figure out the meaning of the sentence because you can’t figure out how the sentence is stuck together in such a way as to create meaning.

The real problem is that the Japanese you learn in class is one thing, and the Japanese of the street is another. One problem is that in street Japanese, the subject is typically not stated in a sentence. Instead it is inferred through such things as honorific terms or the choice of words you used in the sentence. Probably no one goes crazier on negatives than the Japanese. Particularly in academic writing, triple and quadruple negatives are common, and can be quite confusing.

Yet there are problems with the agglutinative nature of Japanese. It’s a completely different syntactic structure than English. Often if you translate a sentence from Japanese to English it will just look like a meaningless jumble of words.

However, Japanese grammar has the advantage of being quite regular. For instance, there are only four frequently used irregular verbs.

Like Chinese, the nouns are not marked for number or gender. However, while Chinese is forgiving of errors, if you mess up one vowel in a Japanese sentence, you may end up with incomprehension.

Although many Japanese learners feel it’s fairly easy to learn, surveys of language professors continue to rate Japanese as one of the hardest languages to learn. A study by the US Navy concluded that the hardest language the corpsmen had to learn in the course of service was Japanese. However, it’s generally agreed that Japanese is easier to learn than Korean. Japanese speakers are able to learn Korean pretty easily.

Japanese is rated 5, extremely hard.

Classical Japanese is much harder to read than Modern Japanese. Though you can get by with much less kanji when reading the modern language, you will need a minimum knowledge of 3,000 kanji for reading Classical Japanese, and that’s using a dictionary. There are only about 500-1,000 frequently used characters, but there are countless other words that will come up in your reading especially say special words used in the Imperial Court. Many words have more than one meaning, and unless you know this, you will be lost. 東宮(とうぐう) for instance means Eastern Palace. However, it also means Crown Prince because his residence was to the east of the Emperor’s.

The movie The Seven Samurai (set in the late 1500’s) seems to use some sort of Classical Japanese, or at least Classical vocabulary and syntax with modern pronunciation. Japanese language learners say they can’t understand a word of the archaic Japanese used in this movie.

Classical Japanese gets 5.5, nearly hardest of all.

Western Oghuz

Turkish is often considered to be hard to learn, and it’s rated one of the hardest in surveys of language teachers, however, it’s probably easier than its reputation made it out to be. It is agglutinative, so you can have one long word where in English you might have a sentence of shorter words. One word is

Were you one of those people whom we could not turn into a Czechoslovakian?

Many words have more than one meaning. However, the agglutination is very regular in that each particle of meaning has its own morpheme and falls into an exact place in the word. See here:

göz            eye
göz-lük        glasses
göz-lük-çü     optician
göz-lük-çü-lük the business of an optician

Nevertheless, agglutination means that you can always create new words or add new parts to words, and for this reason even a lot of Turkish adults have problems with their language.

There is no verb to be, which is hard for many foreigners. Instead, the concept is wrapped onto the subject of the sentence as a -dim or -im suffix. Turkish is an imagery-heavy language, and if you try to translate straight from a dictionary, it often won’t make sense.

However, the suffixation in Turkish, along with the vowel harmony, are both precise. Nevertheless, many words have irregular vowel harmony. The rules for making plurals are very regular, with no exceptions (the only exceptions are in foreign loans). In Turkish, incredible as it sounds, you can make a plural out of anything, even a word like what, who or blood. However, there is some irregularity in the strengthening of adjectives, and the forms are not predictable and must be memorized.

Turkish is a language of precision in other ways. For instance, there are eight different forms of subjunctive mood that describe various degrees of uncertainty that one has about what one is talking about. This relates to the evidentiality discussed under Tuyuca above, and Turkish has an evidential form similar to Tamil and Bulgarian. On Turkish news, verbs are generally marked with miş, which means that the announcer believes it to be true though he has not seen it firsthand. The particle miş is interesting because this evidential form is coded into the tense system, which is an unusual use of evidentiality.

The Roman alphabet and almost mathematically precise grammar really help out. Turkish lacks gender and has but a single irregular verb – olmak. Nevertheless, there are many verbal forms. However, this is controversial and it depends on how you define grammatical irregularity. There is some strangeness in some of the verb paradigms, but it is argued that these oddities are rule-based. The aorist tense is said to have irregularity.

There is some irregular morphophonology, but not much. The oblique relative clauses have complex morphosyntax. Turkish has two completely different ways of making relative clauses, one of which may have been borrowed from Persian. There are many gerunds for verbs, and these have many different uses. At the end of the day, Turkish grammar is not as regular or as simple as it is made out to be.

Words are pronounced nearly the same as they are written. A suggestion that Turkish may be easier to learn that many think is the research that shows that Turkish children learn attain basic grammatical mastery of Turkish at age 2-3, as compared to 4-5 for German and 12 for Arabic. The research was conducted in Germany in 2005.

In addition, Turkish has a phonetic orthography.

However, Turkish is hard for an English speaker to learn for a variety of reasons. It is agglutinative like Japanese, and all agglutinative languages are difficult for English speakers to learn. As in Japanese, you start your Turkish sentence the way you would end your English sentence. As in the Japanese example above, the subordinate clause must precede the subject, whereas in English, the subordinate clause must follow the subject. The italicized phrase below is a subordinate clause.

In English, we say, “I hope that he will be on time.”

In Turkish, the sentence would read, “That he will be on time I hope.”

Turkish vowels are unusual to speakers of IE languages, and Turkish learners say the vowels are hard to make or even tell apart from one another.

Turkish is rated 3.5, harder than average to learn.



One test of the difficulty of any language is how much of the grammar you must know in order to express yourself on a basic level. On this basis, Finno-Ugric languages are complicated because you need to know quite a bit more grammar to communicate on a basic level in them than in say, German.


Finnish is very hard to learn, and even long-time learners often still have problems with it. Famous polyglot Barry Farber said it was one of the hardest languages he learned. You have to know exactly which grammatical forms to use where in a sentence. In addition, Finnish has 15 cases in the singular and 16 in the plural. This is hard to learn for speakers coming from a language with little or no case.

For instance,
talothe house


talon        house's
taloasome    of the house
taloksiinto  as the house
talossain    the house
talostafrom  inside the house
talooninto   the house
talollaon    to the house
taloltafrom  beside the house
talolleto    the house
taloistafrom the houses
taloissa     in the houses

It gets much worse than that. This web page shows that the noun kauppashop can have 2,253 forms.

A simple adjective + noun type of noun phrase of two words can be conjugated in up to 100 different ways.

Adjectives and nouns belong to 20 different classes. The rules governing their case declension depend on what class the substantive is in.

As with Hungarian, words can be very long. For instance:

non-commissioned officer cadet learning to be an assistant mechanic for airplane jet engines

Like Turkish, Finnish agglutination is very regular. Each bit of information has its own morpheme and has an exact place in the word.

Like Turkish, Finnish has vowel harmony, but the vowel harmony is very regular like that of Turkish. Unlike Turkish or Hungarian, consonant gradation forms a major part of Finnish morphology. In order to form a sentence in Finnish, you will need to learn about verb types, cases and consonant gradation, and it can take a while to get your mind around those things.

Finnish, oddly enough, always puts the stress on the first syllable. Finnish vowels will be hard to pronounce for most foreigners.

However, Finnish has the advantage of being pronounced precisely as it is written. This is also part of the problem though, because if you don’t say it just right, the meaning changes. So, similarly with Polish, when you mangle their language, you will only achieve incomprehension. Whereas with say English, if a foreigner mangles the language, you can often winnow some sense out of it.

However, despite that fact that written Finnish can be easily pronounced, when learning Finnish, as in Korean, it is as if you must learn two different languages – the written language and the spoken language. A better way to put it is that there is “one language for writing and another for speaking.” You use different forms whether conversing or putting something on paper.

Some pronunciation is difficult. The the contrast between short and long vowels and consonants is particularly troublesome. Check out these minimal pairs:



A problem for the English speaker coming to Finnish would be the vocabulary, which is alien to the speaker of an IE language. Finnish language learners often find themselves looking up over half the words they encounter. Obviously, this slows down reading quite a bit!

In the grammar, the partitive case and potential tense can be difficult. Here is an example of how Finnish verb tenses combine with various cases to form words:

I A-Infinitive
Base form mennä

II E-Infinitive
Active inessive    mennessä
Active instructive mennen
Passive inessive   mentäessä

III MA-Infinitive
Inessive            menemässä
Elative             menemästä
Illative            menemään
Adessive            menemällä
Abessive            menemättä
Active instructive  menemän
Passive instructive mentämän

Verbs in Finnish

Finnish verbs are very regular. The irregular verbs can almost be counted on one hand:


and a few others. In fact, on the plus side, Finnish in general is very regular.

One easy aspect of Finnish is the way you can build many forms from a base root:


to write

As in many Asian languages, there are no masculine or feminine pronouns, and there is no grammatical gender. The numeral system is quite simple compared to other languages. Finnish has a complete lack of consonant clusters. In addition, the phonology is fairly simple.

Finnish is rated 5, extremely hard to learn.


Estonian has similar difficulties as Finnish, since they are closely related. However, Estonian is more irregular than Finnish. In particular, the very regular agglutination system described in Finnish seems to have gone awry in Estonian. Estonian has 14 cases, including strange cases such as the abessive, adessive, elative and inessive. On the other hand, all of these cases can simply be analyzed as the genitive case plus a single unvarying suffix for each case. In addition, there is no gender, so the only things you have to worry about when forming cases are singular and plural.

Estonian has a strange mood form called the quotative, often translated as “reported speech.”

tema onhe/she/it is

tema olevatit’s rumored that he/she/it is or he/she/it is said to be

This mood is often used in newspaper reporting and is also used for gossip.

Estonian has an astounding 25 diphthongs. It also has three different varieties of vowel length, which is strange in the world’s languages. There are short, vowels and extra-long vowels and consonants.

linalinen – short n
the town’s – long n, written as nn
into the town – extra-long n, not written out!

There are differences in the pronunciation of the three forms above, but in rapid speech, they are hard to hear, though native speakers can make them out. Difficulties are further compounded in that extra-long sonorants (m, n, ng, l, and r) and vowels and are not written out. All in all, phonemic length can be a problem in Estonian, and foreigners never seem to get it completely down.

Estonian pronunciation is not very difficult, though the õ sound can cause problems. However, Estonian has completely lost the vowel harmony system it inherited from Finnish, resulting in words that seem very hard to pronounce.

At least in written form, Estonian is not as complex as Finnish. Estonian can be seen as an abbreviated and modernized form of Finnish. The grammar is also like a simplified version of Finnish grammar and may be much easier to learn.

Estonian is rated 4.5, very to extremely difficult.


Skolt Sami‘s Latinization is often listed as one of the worst Latinizations around. The rest of the language is quite similar to, and as difficult as, Finnish.

Skolt Sami gets a 5 rating, extremely hard to learn.


It’s widely agreed that Hungarian is one of the hardest languages on Earth to learn. Even language professors agree. The British Diplomatic Corps did a study of the languages that its diplomats commonly had to learn and concluded that Hungarian was the hardest. Hungarian grammar is maddeningly complex, and Hungarian is often listed on craziest grammar lists. For one thing, there are many different forms for a single word via word modification. This enables the speaker to make his intended meaning very precise. Looking at nouns, there are about 257 different forms per noun.

Hungarian is said to have from 24-35 different cases (there are charts available showing 31 cases), but the actual number may only be 18. Nearly everything in Hungarian is inflected, similar to Lithuanian or Czech. Similar to Georgian and Basque, Hungarian has the polypersonal agreement, albeit to a lesser degree than those two languages. There are many irregularities in inflections, and even Hungarians have to learn how to spell all of these in school and have a hard time learning this.

The case distinctions alone can create many different words out of one base form. For the word house, we end up with 31 different words using case forms:

házbainto the house
in the house
from [within] the house
onto the house
on the house
off [from] the house
to the house
until/up to the house
at the house
[away] from the house
– Translative case, where the house is the end product of a transformation, such as They turned the cave into a house.
as the house, which could be used if you acted in your capacity as a house or disguised yourself as one. He dressed up as a house for Halloween.
for the house, specifically things done on its behalf or done to get the house. They spent a lot of time fixing things up (for the house).
– Essive-modal case. Something like “house-ly” or in the way/manner of a house. The tent served as a house (in a house-ly fashion).

And we do have some basic cases:

ház – Nominative. The house is down the street.
– Accusative. The ball hit the house.
– Dative. The man gave the house to Mary.
– Similar to instrumental, but more similar to English with. Refers to both instruments and companions.

The genitive takes 12 different declensions, depending on person and number:

házammy house
my houses
your house
your houses
his/her/its house
his/her/its houses
our house
our houses
your house
your house
their house
their houses
church, as in the Catholic Church. (Literally one-house)

In addition, the genitive suffixes to the possession, which is not how the genitive works in IE.


az ember házathe man’s house (Lit. the man house-his)
a házammy house (Lit. the house-my)
a házadyour house (Lit. the house-your)

There are also very long words such as this:

for your (you all possessive) repeated pretensions at being impossible to desecrate

Being an agglutinative language, that word is made up of many small parts of words, or morphemes. That word means something like

The preposition is stuck onto the word in this language, and this will seem strange to speakers of languages with free prepositions.

Hungarian is full of synonyms, similar to English.

For instance, there are 78 different words that mean to move: halad, jár, megy, dülöngél, lépdel, botorkál, kódorog, sétál , andalog, rohan, csörtet, üget, lohol, fut, átvág, vágtat, tipeg, libeg, biceg, poroszkál, vágtázik, somfordál , bóklászik, szedi a lábát, kitér, elszökken, betér , botladozik, őgyeleg, slattyog, bandukol, lófrál, szalad, vánszorog, kószál, kullog, baktat, koslat, kaptat, császkál, totyog, suhan, robog, rohan, kocog, cselleng, csatangol, beslisszol, elinal, elillan, bitangol, lopakodik, sompolyog, lapul, elkotródik, settenkedik, sündörög, eltérül, elódalog, kóborol, lézeng, ődöng, csavarog, lődörög, elvándorol , tekereg, kóvályog, ténfereg, özönlik, tódul, vonul, hömpölyög, ömlik, surran, oson, lépeget, mozog and mozgolódik .

Only about five of those terms are archaic and seldom used, the rest are in current use. However, to be a fair, a Hungarian native speaker might only recognize half of those words.

In addition, while most languages have names for countries that are pretty easy to figure out, in Hungarian even languages of nations are hard because they have changed the names so much. Italy becomes Olazorszag, Germany becomes Nemetzorsag, etc.

As in Russian and Serbo-Croatian, word order is relatively free in Hungarian. It is not completely free as some say but rather is it governed by a set of rules. The problem is that as you reorder the word order in a sentence, you say the same thing but the meaning changes slightly in terms of nuance. Further, there are quite a few dialects in Hungarian. Native speakers can pretty much understand them, but foreigners often have a lot of problems. Accent is very difficult in Hungarian due to the bewildering number of rules used to determine accent. In addition, there are exceptions to all of these rules. Nevertheless, Hungarian is probably more regular than Polish.

Hungarian spelling is also very strange for non-Hungarians, but at least the orthography is phonetic. Nevertheless, the orthography often makes it onto worst orthographies lists.

Hungarian phonetics is also strange. One of the problems with Hungarian phonetics is vowel harmony. Since you stick morphemes together to make a word, the vowels that you have used in the first part of the word will influence the vowels that you will use to make up the morphemes that occur later in the word. The vowel harmony gives Hungarian a “singing effect” when it is spoken. The ty, ny, sz, zs, dzs, dz, ly, cs and gy sounds are hard for many foreigners to make. The á, é, ó, ö, ő, ú, ü, ű, and í vowel sounds are not found in English.

Verbs are marked for object (indefinite, definite and person/number), subject (person and number) tense (past, present and future), mood (indicative, conditional and imperative), and aspect (frequency, potentiality, factitiveness, and reflexiveness.

I could make others save you occasionally (on a disk).

Verbs change depending on whether the object is definite or indefinite.

Olvasok könyvet.
I read a book.
(indefinite object)

Olvasom a könvyet.
I read the book.
(definite object)

As noted in the introduction to the Finno-Ugric section, you need to know quite a bit of Hungarian grammar to be able to express yourself on a basic level. For instance, in order to say:

I like your sister.

you will need to understand the following Hungarian forms:

  1. verb conjugation and definite or indefinite forms
  2. possessive suffixes
  3. case
  4. how to combine possessive suffixes with case
  5. word order
  6. explicit pronouns
  7. articles

It’s hard to say, but Hungarian is probably harder to learn than even the hardest Slavic languages like Czech, Serbo-Croatian and Polish. At any rate, it is generally agreed that Hungarian grammar is more complicated than Slavic grammar, which is pretty impressive as Slavic grammar is quite a beast.

Hungarian is rated 5, extremely hard to learn.


It’s fairly easy to learn to speak Mandarin at a basic level, though the tones can be tough. This is because the grammar is very simple – short words, no case, gender, verb inflections or tense. But with Japanese, you can keep learning, and with Chinese, you often tend to hit a wall, often because the syntactic structure is so strangely different from English (isolating).

Actually, the grammar is harder than it seems. At first it seems simple, like a simplified English. No word is capable of declension, and there is no tense, case, and number, nor are there articles. But the simplicity makes it difficult. No tense means there is no easy way to mark time in a sentence. Furthermore, tense is not as easy as it seems. Sure, there are no verb conjugations, but instead you must learn some particles and special word orders that are used to mark tense. Mandarin has 12 different adverbs for which there is no good English translation.

Once you start digging into Chinese, there is a complex layer under all the surface simplicity. There is such things as aspect, serial verbs, a complex classifier system, syntax marked by something called topic-prominence, a strange form called the detrimental passive, preposed relative clauses, use of verbs rather than adverbs to mark direction, and all sorts of strange stuff. Verb complements can be baffling, especially potential and directional complements. The 把, 是 and 的 constructions can be very hard to understand.

The topic-prominence is interesting in that only a few major languages have topic-comment syntax, and most of those are Oriental languages with a lot of Chinese borrowing. Topicalization is not marked morphologically.

There are sentences where the entire meaning changes with the addition of a single character. Chinese sentences are SVO (Subject -Verb – Object) at their base, but that is a bit of an illusion. A sentence that causes you to discuss time duration makes you repeat the verb after the direct object – SVOVT (T= time phrase). In the case of topicalization, sentences can have the structure of OSV (Object – Subject – Verb). Relative clauses and all subordinate clauses come before the noun they modify. In other words:

English: The man who always wore red walked into the room.
Chinese: Who always wore red the man walked into the room.

The relative clause in the sentences above is marked in bold.

In Chinese, the prepositional phrase comes between the subject and the verb:

English: The man hit the ball into the yard.
Chinese: The man into the yard hit the ball.

The prepositional phrase is bolded in the sentences above.

In Chinese, adjectives are actually stative verbs as in Nahuatl and Lakota.

Nàgè rède cài hěnhǎochī.
The it is hot food is good to eat.
The hot food is delicious.

The symbol turns food hot into food it is hot, an attributive verb. means something like to be.

There are dozens of words called particles which shade the meaning of a sentence ever so slightly.

Chinese phonology is not as easy as some say. There are way too many instances of the zh, ch, sh, j, q, and x sounds in the language such that many of the words seem to sound the same. There is a distinction between aspirated and nonaspirated consonants. There is also the presence of odd retroflex consonants.

Chinese orthography is probably the most hardest orthography of any language. The alphabet uses symbols, so it’s not even a real alphabet. There are at least 85,000 symbols and actually many more, but you only need to know about 3-5,000 of them, and many Chinese don’t even know 1,000. To be highly proficient in Chinese, you need to know 10,000 characters, and probably less than 5% of Chinese know that many.

In addition, the characters have not been changed in 3,000 years, and the alphabet is at least somewhat phonetic, so we run into a serious problem of lack of a spelling reform.

The Communists tried to simplify the system (simplified Mandarin) but instead of making the connections between the phonetic aspects of character more sensible by decreasing their number and increasing their regularity (they did do this somewhat but not enough), they simply decreased the number of strokes needed for each symbol typically without dealing with the phonetic aspect of all. The simplification did not work well, so now you have a mixture of two different types of written Chinese – simplified and traditional.

In addition to all of this, Chinese borrowed a lot from the Japanese symbolic alphabet a full 1,000 years after it had already been developed and had not undergone a spelling reform, adding insult to injury.

Even leaving the characters aside, the stylistic and literary constraints required to write Chinese in an eloquent or formal (literary) manner would make your head swim. And just because you can read Chinese does not mean that you can read Classical Chinese prose. It’s as if it’s written in a different language – actually, it is technically a different language similar to Middle English or Old English. However, few Middle English or Old English texts are read anymore, and Classical Chinese is still widely read.

However, the orthography is at least consistent. 90% of characters have only one reading. Once you learn the character, you generally know the meaning in any context.

Writing the characters is even harder than reading them. One wrong dot or wrong line either completely changes the meaning or turns the symbol into nonsense.

It’s a real problem when you encounter a symbol you don’t know because there is no way to sound out the word. You are really and truly lost and screwed. There is a clue at the right side of the symbol, but it is not always accurate.You need to learn quite a bit of vocabulary just to speak simple sentences.

Similarly, a dictionary is not necessarily helpful when trying to read Chinese. You can have a Chinese sentence in front of you along with a dictionary, and the sentence still might not make sense even after looking it up in the dictionary.

Some Chinese Muslims write Chinese using an Arabic script. This is often considered to be one of the worst orthographies of all.

The tones are often quite difficult for a Westerner to pick up. If you mess up the tones, you have said a completely different word. Often foreigners who know their tones well nevertheless do not say them correctly, and hence, they say one word when they mean another. However, compared to other tone systems around the world, the tonal system in Chinese is comparatively easy.

A major problem with Chinese is homonyms. To some extent, this is true in many tonal languages. Since Chinese uses short words and is disyllabic, there is a limited repertoire of sounds that can be used. At a certain point, all of the sounds are used up, and you are into the realm of homophones.

Tonal distinctions are one way that monosyllabic and disyllabic languages attempt to deal with the homophone problem, but it’s not good enough, since Chinese still has many homophones, and meaning is often discerned by context, stress, rhythm and intonation. Chinese, like French and English, is heavily idiomatic.

It’s little known, but Chinese also uses different forms (classifiers) to count different things, like Japanese.

There is zero common vocabulary between English and Chinese, so you need to learn a whole new set of lexical forms.

In addition, nouns often show relatedness or hierarchy. For instance, in English, you can simply say my brother or my sister, but in Chinese, you cannot do this. You have to indicate whether you are speaking of an older or younger sibling.

mei meiyounger sister
jie jie
older sister
ge ge
older brother
di di
younger brother

Mandarin scored very high on a weirdest languages study.

On the positive side, Chinese grammar is fairly regular and word derivation, compound words are sensible and the meaning can be determined by looking at the word. In other languages, compound words are not necessarily so obvious.

Many agree that Chinese is the hardest to learn of all of the major languages. A recent survey of language professors rated Chinese as the hardest language on Earth to learn.

Mandarin gets a 5.5 rating for nearly hardest of all.

However, Cantonese is even harder to learn than Mandarin. Cantonese has eight tones to Mandarin’s four, and in addition, they continue to use a lot of the older traditional Chinese characters that were superseded when China moved to a simplified script in 1949. Furthermore, since non-Mandarin characters are not standardized, Cantonese cannot be written down as it is spoken.

In addition, Cantonese has verbal aspect, possibly up to 20 different varieties. Modal particles are difficult in Cantonese. Clusters of up to the 3 sentence final particles are very common. 我食咗飯 and 我食咗飯架啦喎 are both grammatical for I have had a meal, but the particles add the meaning of I have already had a meal, answering a question or even to imply I have had a meal, so I don’t need to eat anymore.

Cantonese gets a 5.5 rating, nearly hardest of all.

Min Nan is also said to be harder to learn than Mandarin, as it has a more complex tone system, with five tones on three different levels. Even many Taiwanese natives don’t seem to get it right these days, as it is falling out of favor, and many fewer children are being raised speaking it than before.

Min Nan gets a 5.5 rating, nearly hardest of all.

A recent 15 year survey out of Fudan University utilizing both the departments of Linguistics and Anthropology looked at 579 different languages in 91 linguistic families in order to try to find the most complicated language in the world. The result was that a Wu language dialect (or perhaps a separate language) in the Fengxian district of southern Shanghai (Dônđän Wu) was the most phonologically complex language of all, with 20 separate vowels (Wang 2012). The nearest competitor was Norwegian with 16 vowels.

Dônđän Wu gets a 5.5 rating, nearly hardest of all.

Classical Chinese is still read by many Chinese people and Chinese language learners. Unless you have a very good grasp on modern Chinese, classical Chinese will be completely wasted on you. Classical Chinese is much harder to read than reading modern Chinese.

Classical Chinese covers an era extending over 3,000 years, and to attain a reading fluency in this language, you need to be familiar with all of the characters used during this period along with all of the literature of the period so you can understand all the allusions. Even with a knowledge of Classical Chinese, you need to read it in context. If you are good at Classical Chinese and someone throws you a random section of it, it will take you a good amount of time to figure it out unless you know context.

The language is much more to the point than Modern Chinese, but this is not as good as it sounds. This simplicity leaves a room for ambiguity, and context plays an important role. A joke about some obscure historical or literary anecdote will be lost you unless you know what it refers to. For reading modern Chinese, you will need at least 5,000 characters, but even then, you will still need a dictionary. With Classical Chinese, there are no lower limits on the number of characters you need to know. The sky is the limit.

Classical Chinese gets a 6 rating, hardest of all.


In Quiang, a language of Sichuan Province in China, not only are there rhotic vowels, which are present in only 1% of the world’s languages, but there is also rhoticity harmony, where a non-rhotic vowel in a morpheme becomes rhotic when it is followed by a morpheme with a rhotic vowel.

ʀuɑ +e˞ > ʀuɑ˞kʰ
+ w ˞> mw

Rhotic vowels are found in US English – Unstressed ɚ: standard, dinner, Lincolnshire, editor, measure, martyr.

Qiang also has a very bad romanization, so bad that the Qiang will not even use it. Voiced consonants are written by adding a vowel to the symbol for the voiceless consonant. It has long and short vowels, but these are not represented in the system.

Qiang gets a 5 rating, extremely hard to learn.

Western Tibeto-Burman
Central Bodish

Tibetan probably has one of the least rational orthographies of any language. The orthography has not changed in ~1,000 years while the language has gone through all sorts of changes. A langauge learner in Tibet can get by using phonetic spelling. The problem comes when you try to spell using the Classical Alphabet. For instance:

Srong rtsan Sgam po (written)
soŋtsɛn ɡampo (spoken)

bsgrubs (written)

d`up (spoken)

While the orthography is etymological and completely outdated, it is quite predictable.

Tibetan gets a 5 rating, extremely hard to learn.


Dzongka, the official language of Bhutan, has some pretty wild phonology, in addition to having the Tibetan writing system, this time using Bhutanese forms of the Tibetan script.

It contrasts all of the following: s, , ʰs, ʰsʰ, ts, ʰts, tsʰ, z, ʱz, dz, ʱdz, ⁿsʰ, ᵐtsʰ, ⁿtsʰ, ⁿdz, ᵖts, ᵖtsʰ, ᵖtsʷʰ, and ᶲs, and in addition it has four tones, but there is no single word that is distinguished by tone only. On top of that, there are 22 different vowels.

Dzongka gets a 5 rating, extremely hard to learn.


Vietnamese is also hard to learn because to an outsider, the tones seem hard to tell apart. Therefore, foreigners often make themselves difficult to understand by not getting the tone precisely correct. It also has “creaky-voiced” tones, which are very hard for foreigners to get a grasp on.

Vietnamese grammar is fairly simple, and reading Vietnamese is pretty easy once you figure out the tone marks. Words are short as in Chinese. However, the simple grammar is relative, as you can have 25 or more forms just for I, the 1st person singular pronoun. In addition, the Latin orthography is said to be quite bad. It was invented by missionaries a few centuries ago, and it has never made much sense.

Vietnamese gets 5 rating, extremely hard to learn.


Khmer has a reputation for being hard to learn. I understand that it has one of the most complex honorifics systems of any language on Earth. Over a dozen different words mean to carry depending on what one is carrying. There are several different words for slave depending on who owned the slave and what the slave did. There are 28-30 different vowels, including sets of long and short vowels and long and short diphthongs. The vowel system is so complicated that there isn’t even agreement on exactly what it looks like. Khmer learners, especially speakers of IE languages, often have a hard time producing or even distinguishing these vowels.

Speaking it is not so bad, but reading and writing it is pretty difficult. For instance, you can put up to five different symbols together in one complex symbol. The orthographic script is even worse than the Thai one. There are actually rules to this mess, but no one seems to know who they are.

Khmer gets a  4.5 rating, very to extremely hard.

North Bahnaric

Sedang, a language of Vietnam, has the highest number of vowel sounds of any language on Earth, at 55 distinct vowel sounds.

Sedang gets a 5 rating, extremely hard to learn.


Hmong is widely spoken in this part of California, but it’s not easy to learn. There are eight tones, and they are not easy to figure out. It’s not obviously related to any other major language but the obscure Mien.

It has some very strange consonants called voiceless nasals. We have them in English as allophones – the m in small is voiceless, but in Hmong, they put them at the front of words – the m in the word Hmong is voiceless. These can be very hard to pronounce.

The romanization is widely criticized for being a lousy one, but the Hmong use it anyway.

Hmong gets a 5 rating, extremely hard to learn.


Tsou is a Taiwanese aborigine language spoken by about 2,000 people in Taiwan. It has the odd feature whereby the underlying glides y and w turn into or surface as non-syllabic mid vowels e̯ and o̯ in certain contexts:

jo~joskɨ -> e̯oˈe̯oskɨ  -= fishes

Tsou is also ergative like most Formosan languages. Tsou is the only language in the world that has no prepositions or anything that looks like a preposition. Instead it uses nouns and verbs in the place of prepositions. Tsou allows more potential consonant clusters than most other languages. About 1/2 of all possible CC clusters are allowed.

Tsou has an inclusive/exclusive distinction in the 1st person plural and a very strange visible and non-visible distinction in the 3rd person singular and plural. Both adjectives and adverbs can turn into verbs and are marked for voice in the same way that verbs are. Verbs are extensively marked for voice. Nouns are marked for a variety of odd cases, often referring to perception, (visible/invisible) person, and place deixis.

‘e –               visible and near speaker
si/ta –           visible and near hearer
ta –               visible but away from speaker
‘o/to –           invisible and far away, or newly introduced to discourse
na/no ~ ne – non-identifiable and non-referential (often when scanning a class of elements)

Tsou gets a 5 rating, extremely hard to learn.


Bahasa Indonesia is an easy language to learn. For one thing, the grammar is dead simple. There are only a handful of prefixes, only two of which might be seen as inflectional. There are also several suffixes. Verbs are not marked for tense at all. And the sound system of these languages, in common with Austronesian in general, is one of the simplest on Earth, with only two dozen phonemes. Bahasa Indonesia has few homonyms, homophones, homographs, or heteronyms. Words in general have only one meaning.

Though the orthography is not completely phonetic, it only has a small number of nonphonetic exceptions. The orthography is one of the easiest on Earth to use.

The system for converting words into either nouns or verbs is regular. To make a plural, you simply repeat a word, so instead of saying pencils, you say pencil pencil.

Bahasa Indonesia gets a 1.5 rating, extremely easy to learn.

Malay is only easy if you learn the standard spoken form or one of the creoles. Learning the literary language is quite a bit more difficult. However, the Jawi script, which is Malay written in Arabic script, is often considered to be perfectly awful.

Malay get a 2 rating for moderately easy.

Greater Central Philippine
Central Philippine

However, Tagalog is much harder than Malay or Indonesian. Compared to many European languages, Tagalog syntax, morphology and semantics are often quite different. Also, Tagalog is typically spoken very fast. Unlike Malay, verbs conjugate quite a bit in Tagalog. The main idea of Tagalog grammar is something called focus. Once you figure that out, the language gets pretty easy, but until you understand that concept, you are going to have a hard time.

Everything is affixed in Tagalog.

However, articles and creation of adjectives from nouns is very easy.


gandabeauty (noun)
magandabeautiful (adjective)

Tagalog gets a 4 rating, very difficult.

Central-Eastern Malayo-Polynesian
Eastern Malayo-Polynesian
Central-Eastern Oceanic
Remote Oceanic
Central Pacific
East Fijian-Polynesian

Maori and other Polynesian languages have a reputation for being quite easy to learn. The main problem for English speakers is that the sentence structure is backwards compared to English. In addition, macrons can cause problems.

One problem with Maori is dialects. The dialects are so diverse that this means that there are multiple words for the same thing. Swiss German has a similar issue, with up to 50 words for each common household item (nearly every major dialect has its own word for common objects):

ngongi, noni, koki, waiwater
, rarangi, hiri –  to plait, to twist, to weave
, maitaigood
, , tutehu, mātikato stand
, mouto hold
, pouto be exhausted
, tohorāwhale
, ngohifish
, kāwailine
, kori, keukeu, koukou, neke, nukuto move
, hara, here, horo, whanoto go, to come
, hapa, to be wrong
, wānanga, rūnangato discuss
, tahungapriest
, maikukufinger nail
, konohi, mata, whatu, kamo, karueye, face

Entire Maori sentences can be written with vowels only.

E uu aau?
Are yours firm?

I uaa ai.
It rained as usual.

I ui au ‘i auau aau?’
E uaua!
It will be difficult/hard/heavy!

On the plus side, the pronunciation is simple, and there is no gender. The language is as regular as Japanese. No Polynesian language has more than 16 sounds, and they all lack tones. They all have five vowels, which can be either long or short. A consonant must be followed by a vowel, so there are no consonant clusters. All consonants are easy to pronounce.

Maori gets a 3 rating, average difficulty.


Hawaiian is a pretty easy language to learn. It is easy to pronounce, has a simple alphabet, lacks complex morphology and has a fairly simple syntax.

Hawaiian gets a 2 rating, very easy to learn.

North and Central Vanuatu
East Santo

Sakao is a very strange langauge spoken by 4,000 people in Vanuatu.  It is very strange. It is a polysynthetic Austronesian language, which is very weird. It allows extreme consonant clusters. Sakao has an incredible seven degrees of deixis. The language has an amazing four persons: singular, dual, paucal and plural. The neighboring language Tomoko has singular, dual, trial and plural. The trial form is very odd. Sakao’s paucal derived from Tomato’s trial:

they, from three to ten

jørðœl løn
the five of them
(Literally, they three, five)

All nouns are always in the singular except for kinship forms and demonstratives, which only display the plural:

ðjœɣmy mother/aunt -> rðjœɣmy aunts

walðyɣmy child -> raalðyɣmy children

It has a number of nouns that are said to be “inalienably possessed”, that is, whenever they occur, they must be possessed by some possessor. These often take highly irregular inflections:

Sakao 	  English
œsɨŋœ-ɣ   my mouth
œsɨŋœ-m   thy mouth
ɔsɨŋɔ-n   his/her/its mouth
œsœŋ-...  ...'s mouth	

uly-ɣ 	  my hair
uly-m 	  thy hair
ulœ-n 	  his/her/its hair
nøl-...   ...'s hair

Here, mouth is either œsɨŋœ-, ɔsɨŋɔ- or œsœŋ-, and hair is either uly-, ulœ- or nøl-

Sakao, strangely enough, may not even have syllables in the way that we normally think of them. If it does have syllables at all, they would appear to be at least a vowel optionally  surrounded by any number of consonants.

i (V)

Having sung and stopped singing thou kept silent.

Sakao has a suffix -in that makes an intransitive verb transitive and makes a transitive verb ditransitive. Ditransitive verbs can take two arguments – a direct object and an instrumental.

Mɨjilɨn amas ara./Mɨjilɨn ara amas.
He kills the pig with the club
/He kills with the club the pig.

Sakao polysynthesis allows compound verbs, each one having its own instrument or object:

Mɔssɔnɛshɔβrɨn aða ɛðɛ.
He-shooting-fish-kept-on-walking with-a-bow the-sea.
He walked along the sea shooting the fish with a bow.

Sakao gets a 5 rating, extremely hard to learn.

Central-Eastern Oceanic
Southeast Solomonic
Malaita–San Cristobal
Northern Malaita

Kwaio is an Austronesian language spoken in the Solomon Islands. It has four different forms of number to mark pronouns – not only the usual singular and plural, but also the rarer dual and the very rare paucal. In addition, there is an inclusive/exclusive contrast in the non-singular forms.

For instance:

1 dual inclusive (you and I)
1 dual exclusive (I and someone else, not you)

1 paucal inclusive (you, I and a few others)
1 paucal exclusive (I and a few others)

1 plural inclusive (I, you and many others)
1 plural exclusive (I and many others)

Pretty wild!

Kwaio gets a 5 rating, extremely hard to learn.

Greater Barito
East Barito

Malagasy, the official language of Madagascar, has a reputation for being even easier to learn than Indonesian or Malay.

Malagasy gets a 1 rating, easiest of all to learn.


Thai is a pretty hard language to learn. There are 75 symbols in the strange script, there are no spaces between words in the script, and vowels can come before, after, above or below consonants in any given syllable. There seem to be many different glyphs for every consonant, but the different glyphs for the same consonant will sometimes change the sound of the neighboring vowel. The orthography is as insensible as that of English since centuries have gone by with no spelling reforms, in fact, Thai has not changed its system in 1000 years. The wild card of having tone thrown in adds to the insanity.

Consonant pronunciations vary depending on the location of the syllable in the word – for instance, s can change to t. There are many vowels which are spoken but not written. There are many consonants that are pronounced the same – for instance, there are six different t‘s, not counting the s‘s that turn into t‘s. The Thai script is definitely one of the most difficult phonetic scripts. Nevertheless, the Thai script is easier to learn than the Japanese or Chinese character sets. In spite of all of that, the syntax is simple, like Chinese.

There are five tones, including a neutral tone. Tones are determined by a variety of complex things, including a combination of tone marks, the class of consonants, if the syllable ends in a sonorant or a stop and what the tone of the preceding syllable was. Tone marking in the orthography is quite complex.

The vowels are different than in many languages, and there are some unusual diphthongs: eua, euai, aui and uu. There is a contrast between aspirated and unaspirated consonants.

There is a system of noun classifiers for counting various things, similar to Japanese. In addition, common to many Asian languages, there is a complicated honorifics system.

On the plus side, Thai is a regular language, with few exceptions to the rules. However, the rules are quite complex. The syntax is about as complex as that of Chinese, and the grammar is dead simple.

Thai gets a 5 rating, hardest of all to learn.

Lao is very similar to Thai, in fact it is identical to a Thai language spoken by 16 million people in northeast Thailand called Northeastern Thai. The Lao script is similar to Thai, but it has fewer letters so there is somewhat less confusion.

Lao gets a 4.5 rating, very to extremely hard to learn.


The Kam languages of the Dong people in southwest China were rated by the Fudan University study referenced above under Wu as the 2nd most phonologically complex on Earth (Wang 2012). There are 32 stem initial consonants, including oddities like , tɕʰ, , pʲʰ, ɕ, , kʷʰ, ŋʷ, tʃʰ, tsʰ. Note the many contrasts between aspirated and unaspirated voiceless consonants, including bilabial palatalized stops, labialized velar stops, and alveolar affricates. There are an incredible 64 different syllable finals, and 14 others that occur only in Chinese loans.

There are an astounding 15 different tones, nine in open syllables and six in checked syllables (entering tones). Main tones are high, high rising, high falling, low, low rising, low falling, mid, dipping and peaking. When they speak, it sounds as if they are singing.

Kam gets a 5 rating, extremely hard to learn.


According to the Fudan University study quoted above, Buyang in the 3rd most phonologically complex language in the world. Buyang is a cluster of 4 related languages spoken by 1,900 people in Yunnan Province, China. Buyang has a completely wild consonant inventory.

It has a full set of both voiced and voiceless plain and aspirated stops, including voiceless uvulars. The contrast between aspirated and plain voiced stops is peculiar. The stop series also has distinctions between palatalized and rounded stops throughout the series. It has a labialized voiceless palatal fricative and a voiceless dental aspirated lateral, unusual sounds. It has four different voiceless aspirated nasals. It has voiceless y and w, more odd sounds. It also has plain and labialized palatal glides.

That is one heck of a wild phonology.

Buyang gets a 5 rating, extremely hard to learn.


The African Bantu language Ga has a bad reputation for being a tough nut to crack. It is spoken in Ghana by about 600,000 people. It has two tones and engages in a strange behavior called tone terracing that is common to many West African languages. There is a phonemic distinction between three different types of vowel length. All vowels have 3 different lengths – short, long and extra long. It also has many sounds that are not in any Western languages.

Ga gets a 5 rating, extremely hard to learn.

Central Bia

Anyi is a language spoken by 610,000 people in Côte d’Ivoire.  It is relatively straightforward as far as African languages go. Probably the hardest part about the language is that it is tonal, and it does have two tones. The phonology does have the unusual +-ATR contrast which will seem very odd. ATR stands for advanced tongue root, so the language has a contrast between vowels with an advanced tongue root and without one. However, the grammar is pretty regular. There are few confusing phonological processes.

Anyi has a simple tense system, with only present, past and future. There is no aspect, mood or voice marking, and it lacks the noun class systems so common in many African languages. It has a plural marker, but it is often optional.

The syntax does have serial verbs, which will seem odd to Westerners. It distinguishes between relative clauses marked with and subordinate clauses marked with .

Anyi gets a 4 rating, very hard to learn.

Narrow Bantu

Ndali is a Bantu language with 150,000 speakers spoken in Malawi and Tanzania. It has many strange tense forms. For instance, in the past tense:

Past tense A: He went just now.
Past tense B: He went sometime earlier today.
Past tense C: He went yesterday.
Past tense D: He went sometime before yesterday.

Future tense is marked similarly:

Future tense A: He’s going to go right away.
Future tense B: He’s going to go sometime later today.
Future tense C: He’s going to go tomorrow.
Future tense D: He’s going to go sometime after tomorrow.

Ndali gets a 5 rating, extremely hard to learn.


Xhosa, a language of South Africa, is quite difficult, with up to nine click sounds. Clicks only exist in one language outside of Africa – the Australian language Damin – and are extremely difficult to learn. Even native speakers mess up the clicks sometimes. Nelson Mandela said he had problems making some of the click sounds in Xhosa. The phonemics in general of Xhosa are pretty wild.

Xhosa gets a 5 rating, extremely hard to learn.

Zulu and Ndebele also have these impossible click sounds. However, outside of click sounds, the phonology of Nguni languages is straightforward. All Nguni languages are agglutinative. These languages also make plurals by changing the prefix of the noun, and the manner varies according the noun class. If you want to look up a word in the dictionary, first of all you need to discard the prefix. For instance, in Ndebele,

imifula, but

–  amatsheyet


Ndebele gets a 5 rating, hardest of all.

Zulu has pitch accent, tones and clicks. There are nine different pitch accents, four tones and three clicks, but each click can be pronounced in five different ways. However, tones are not marked in writing, so it’s hard to figure out when to use them. Zulu also has depressor consonants, which lower the tone in the vowel in the following syllable. In addition, Zulu has multiple gender – 15 different genders. And some nouns behave like verbs. It also has 12 different noun classes, but 90% of words are part of a group of only three of those classes.

Zulu gets a 5 rating, extremely hard to learn.


For unknown reasons, Swahili is generally considered to be an easy language to learn. The US military ranks it 1, with the easiest of all languages to learn. This seems to be the typical perception. Why Swahili is so easy to learn, I am not sure. It’s a trade language, and trade languages are often fairly easy to learn. There’s also a lot of controversy about whether or not Swahili can be considered a creole, but that has not been proven. For the moment, the reasons why Swahili is so easy to learn will have to remain mysterious.

On the down side, Swahili has many noun classes, but they have the benefit of being more or less logical.

Swahili gets a 2 rating, moderately easy.

Southern Africa

!Xóõ (Taa), spoken by only 4,200 Bushmen in Botswana and Namibia, is a notoriously difficult Khoisan language replete with the notoriously impossible to comprehend click sounds. Taa has anywhere from 130 to 164 consonants, the largest phonemic inventory of any language. Of this vast wealth of sounds, there are anywhere from 30-64 different click sounds. There are five basic clicks and 17 accompanying ones. Speakers develop a lump on their larynx from making the click sounds.

In addition, there are four types of vowels: plain, pharyngealized, breathy-voiced and strident. On top of that, there are four tones. Taa appears on many lists of the wildest phonologies and craziest languages period on Earth.

Taa gets a 5 rating, extremely hard to learn.


Ju|’hoan, a Khoisan language spoken by 5,000 people in Botswana, has one of the wildest phonological inventories on Earth. The voiced aspirated consonants – sb͡pʰd͡tʰ , d͡tsʰ , d͡tʃʰ , ɡ͡kʰ , and ᶢǃʰ  – are particularly odd. Some question whether these segments actually exist and say that they are instead spoken with a “breathy-voice.” However, voiced aspirated consonants do appear to be real. In addition, Ju|’hoan has a closed class of only 17 adjectives since descriptive functions are done by verbs. They are the following:

(those remaining)
other (strange)
a certain

the numbers one through four

Ju|’hoan scored very high on a study of the weirdest languages on Earth.

Ju|’hoan gets a 5 rating, extremely hard to learn.


Inuktitut is extremely hard to learn. Inuktitut is polysynthetic-agglutinative, and roots can take many suffixes, in some cases up to 700. Verbs have 63 forms of the present indicative, and conjugation involves 252 different inflections. Inuktitut has the complicated polypersonal agreement system discussed under Georgian above and Basque below. In a typical long Inuktitut text, 92% of words will occur only once. This is quite different from English and many other languages where certain words occur very frequently or at least frequently. Certain fully inflected verbs can be analyzed both as verbs and as nouns. Words can be very long.

I truly don’t know how to speak Inuktitut very well.

You may need to analyze up to 10 different bits of information in order to figure out a single word. However, the affixation is all via suffixes (there are no prefixes or infixes) and the suffixation is extremely regular.

Inuktitut is also rated one by linguists one of the hardest languages on Earth to pronounce. Inuktitut may be as hard to learn as Navajo.

Inuktitut is rated 6, hardest of all.

Kalaallisut (Western Greenlandic) is very closely related to Inuktitut. Look at this sentence:

However, they will say that he is a great entertainer, but …

That word is composed of 12 separate morphemes. A single word can conceptualize what could be an entire sentence in a non-polysynthetic language.

Kalaallisut is rated 6, hardest of all.


Chukchi is a polysynthetic, agglutinating and incorporating language and is often listed as one of the hardest languages on Earth to learn.

I have a fierce headache.

There are five morphemes in that word, and there are three lexical morphemes (nouns or adjectives) incorporated in that word: meyŋgreat, levthead, and pəγtache.

Chukchi gets a 6 rating, hardest of all.


Basque, of course, is just a wild language altogether. There is an old saying that the Devil tried to learn Basque, but after seven years, he only learned how to say Hello and Goodbye. Many Basques, including some of the most ardent Basque nationalists, tried to learn Basque as adults. Some of them succeeded, but a very large number of them failed. Based on the number that failed, it does seem that Basque is harder for an adult to learn as an L2 than many other languages are. Basque grammar is maddeningly complex and it often makes it onto craziest grammars and craziest language lists.

There are 11 cases, and each one takes four different forms. The verbs are quite complex. This is because it is an ergative language, so verbs vary according to the number of subjects and the number of objects and if any third person is involved.

This is the same polypersonal agreement system that Georgian has above. Basque’s polypersonal system is a polysynthetic system consisting of two verb types – synthetic and analytical. Only a few verbs use the synthetic form.

Three of Basque’s cases – the absolutive (intransitive verb case), the ergative (intransitive verb case) and the dative – can be marked via affixes to the verb. In Basque, only present simple and past simple synthetic tenses take polypersonal affixes.

The analytical forms are composed of more than one word, while the synthetic forms are all one word. The analytic verbs are built via the synthetic verbs izanbe, ukanhave and egindo.


d-akar-ki-o-gu = We bring it to him/her. The verb is ekarribring.
z-erama-zki-gu-te-n = They took them to us. The verb is eramantake


Ekarriko d-i-o-gu = We’ll bring it to him/her. Literally: We will have-bring it to him/her. The analytic verb is built from ukanhave.

Eraman d-ieza-zki-gu-ke-te = They can take them to us. Literally: They can be taking them to us. The analytic verb is built from izanbe.

Most of the analytic verbs require an auxiliary which carries all sorts of information that is often carried on verbs in other languages – tense, mood, sometimes gender and person for subject, object and indirect object.

Jaten naiz.
Eat I-am-doing.
I am eating.

Jaten nintekeen.
Eat I-was-able-to.
I could eat.

Eman geniezazkiake.
Give we-might-have-them-to-you-male.
We might have given them to you.

In the above, naiz, nintekeen and geniezazkiake are auxiliaries. There are actually 2,640 different forms of these auxiliaries!

A language with ergative morphosyntax in Europe is quite a strange thing, and Basque is the only one of its kind. The ergative itself is quite unusual:

Gizona etorri da.The man has arrived.
Gizonak mutila ikusi du.
The man saw the boy.

= the

The noun gizon takes a different form whether it is the subject of a transitive or intransitive verb. The first sentence is in absolutive case (unmarked) while the second sentence is in the ergative case (marked by the morpheme -k). If you come from a non-ergative IE language, the concept of ergativity itself is difficult enough to conceptualize, much less trying to actually learn an ergative language. Consequently, any ergative language will automatically be more difficult than a non-ergative one for all speakers of IE languages.

Ergativity also works with pronouns.  There are four basic systems:

Nor:           verb has subject only
Nor-Nork:          "    subj. + direct complement
Nor-Nori:          "    subj. + indirect comp.
Nor-Nori-Nork:     "    subj. + indir. + dir. comps.

Some call Basque the most consistently ergative language on Earth.

If you don’t grow up speaking Basque, it’s hard to attain native speaker competence. It’s quite a bit easier to write in Basque than to speak it.

Nevertheless, Basque verbs are quite regular. There are only a few irregularities in conjugations and they have phonetic explanations. In fact, the entire language is quite regular. In addition, most words above the intermediate level are borrowings from large languages, so once you reach intermediate Basque, the rest is not that hard. In addition, pronunciation is straightforward.

Basque is rated 5.5, nearly hardest of all.


Dorani, Yakir. Hebrew speaker, Israel. August 2013. Personal communication.

Hewitt, B. G.. 2005. Georgian: A Learner’s Grammar, p. 29.

Kim, Yuni. December 16, 2003. Vowel Elision and the Morphophonology of Dominance in Aymara. UC Berkeley.

Kirk, John William Carnegie. 1905. A Grammar of the Somali Language: With Examples in Prose and Verse and an Account of the Yibir and Midgan Dialects, pp. 73-74.

Rogers, Jean H. 1978. Differential Focusing in Ojibwa Conjunct Verbs: On Circumstances, Participants, and Events. International Journal of American Linguistics 44: 167-179.

Wang, Chuan-Chao et al. 2012. Comment on ”Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa.” Science 335:657.

This research takes a lot of time, and I do not get paid anything for it. If you think this website is valuable to you, please consider a a contribution to support more of this valuable research.


Filed under !Xóõ, Afroasiatic, Algonquian, Altaic, Arabic, Austro-Asiatic, Austro-Tai, Austronesian, Bahasa Indonesian, Bakjalukasha, Bantu, Basque, Cantonese, Cherokee, Chinantec, Chinese language, Chukchi, Chukotko-Kamchatkan, Cree, Dene-Yenisien, Descriptive, Dravidian, Eskimo-Aleut, Finnic, Finnish, Finno-Ugric Languages, Hebrew, Hmong, Hmong-Mien, Hopi, Hungarian, Inuktitut, Iriquoian, Isolates, Japanese, Japonic, Khmer, Khoisan, Kootenai, Korean language, Language Families, Language Learning, Language Samples, Linguistics, Malayalam, Malayo-Polynesian, Malaysian, Maltese, Mandarin, Maori, Min Nan, Mon-Khmer, Na-Dene, Navajo, NE Caucasian, Nguni, Niger-Congo, Niger-Kordofanian, Nuxálk, Oghuz, Ojibwa, Oto-Manguean, Paleosiberian, Philippine, Quechua, Quechuan, Salishan, Semitic, Sinitic, Sino-Tibetan, Slavey, Tabasaran, Tai-Kadai, Tamil, Tsez, Turkic, Turkish, Ugric, Vietnamese, Xhosa, Yamana

More On The Hardest Languages To Learn – Indo-European Languages

Caution: This post is very long! It runs to 184 pages on the Web. Updated November 25, 2016.

This post will deal with how hard it is for English speakers to learn other IE languages. The English section will necessarily deal with how hard it is for non-English speakers to learn English, and as such will be less scientific. Nevertheless, there are certain things about English that tend to cause problems for many, such as phrasal verbs.

We did a post on this earlier, but it looks like we only scratched the surface. There are many webpages on this topic, and one could read about the subject for a long time, but after a while, things start getting repetitive.

This post is very good. There are more in various places on the Web.

For starters, before we do our own analysis, let’s look at what some other people came up with. This post is very good. They did a survey, and the post describes the results of the survey.

According to the survey, the nine hardest languages to learn overall were Mandarin, Hungarian, Finnish, Polish, Arabic, Hindi, Icelandic, German and Swedish.

The eight hardest languages to speak (or to pronounce correctly, specifically) were French, Mandarin, Polish, Korean, Hungarian, Arabic, Basque and Hindi.

The nine hardest languages to write were Arabic, Mandarin, Polish, French, Serbo-Croatian, Japanese, Russian, Basque and English.

How does that survey line up with the facts? Surveys are just opinions of L2 learners, and carry variant validity. For starters, let’s throw Swedish off the list altogether, as it actually seems to be a pretty easy language to learn. It’s interesting that some people find it hard, but the weight of the evidence suggests that more folks find it easy than difficult.

Mandarin, Arabic, Japanese and Russian of course use different alphabets and this is why they were rated as hard to write.

Method. 42 IE languages were examined. A literature survey, combined with interviews of various L2 language learners was conducted. In addition, 100 years of surveys on the question by language instructors was reviewed. The US military’s School of Languages in Monterey’s ratings system for difficulty of learning various languages was analyzed.

Results were collated in an impressionistic manner along a majority rules line in order to form final opinions. For example, a minority said that Portuguese or Spanish were very hard to learn, but the consensus view was that they were quite easy. In this case, the minority opinion was rejected, and the consensus view was adopted. The work received a tremendous amount of criticism, often hostile to very hostile, after publication, and many changes were made to the text.

Clearly, such a project will necessarily be more impressionistic than scientific. Scientific tests of the relative difficulty of learning different languages will have to await the development of algorithms specifically designed to measure such things. And even then, surely there will be legions of “We can’t prove anything” naysayers, as this is the heyday of the “We can’t prove anything” School of Physics Envy in Linguistics.

One common criticism was, “In Linguistics, the standard view is that there is no such thing as an easy or difficult language to learn. All languages are equally difficult or easy to learn.” Unless we are talking about children learning an L1 (and even then that’s a dubious assertion), this statement was rejected as simply untrue and exemplar of the sort of soft science (“We can’t prove anything about anything”) mushiness that has overtaken Linguistics in recent years.

Sociolinguistics and Applied Linguistics have long been nearly ruined by soft science mushiness, and in recent years, soft science “We can’t prove anything” muddleheadedness has overtaken Historical Linguistics in a horrible way. Bizarrely enough, this epidemic of Physics Envy has been clouded, as one might suspect, in claims of rigorous application of the scientific method.

But hard sciences prove things all the time. Whenever a field claims that almost nothing in the field is provable, you’re heading in the realms of Politically Correct soft science Humanities brain mush.

Results. A ratings system was designed in terms of how difficult it would be for an English-language speaker to learn the language. In the case of English, English was judged according to how hard it would be for a non-English speaker to learn the language. Speaking, reading and writing were all considered.

Ratings. Languages were rated 1-5 based on difficulty for an English speaker, easiest to hardest. 1 = easiest, 2 = moderately easy to average, 3 = average to moderately difficult, 4 = very difficult, 5 = most difficult of all.

Time needed. Time needed for an English language speaker to learn the language “reasonably well”: Level 1 languages = 3 months-1 year. Level 2 languages = 6 months-1 year. Level 3 languages = 1-2 years. Level 4 languages = 2 years. Level 5 languages = 3-4 years, but some may take longer.

Conclusion. The soft science, Politically Correct mush-speak from the swamps of Sociolinguistics currently in vogue, “All languages are equally difficult or easy for any adult to learn,” was rejected. The results of this study indicate that languages to indeed differ dramatically in how difficult they are for L2 English language learners.




Ind0-Ayran languages like Kashmiri, Hindi and especially Sanskrit are quite hard, and Sanskrit is legendary for its extreme complexity.

Central Zone
Western Hindi

The Hindi script is quite opaque to Westerners, some of whom say that Chinese script is easier. You speak one way if you are talking to a man or a woman, and you also need to take into account whether you as speaker are male or female. Gender is also as prominent as in Spanish; you have to remember whether any given noun is masculine or feminine.Hindi is definitely an IE language by its rich system of gender, case and number inflection.

The most difficult aspects of Hindi are the pronunciation and the case system. In addition, Hindi is split ergative, and not only that, but it actually has a tripartite ergative system, and the ergativity is split by tense like in Persian.

The distinction between aspirated/unaspirated and alveolar/retroflex consonants is hard for many to make. There is a four-way distinction ion the t and d sounds with aspirated/unaspirated dental and aspirated/unaspirated retroflex t‘s and d‘s. The are three different r sounds – one that sounds like the English r and two retroflex r‘s that are quite hard to make or even distinguish, especially at the end of a word. Hindi also has nasalized vowels.

If you come from a language that has case, Hindi’s case system will not be overly difficult.

In addition, there is a completely separate word for each number from 1-100, which seems unnecessarily complicated.

However, Hindi has a number of cognates with English. I am not sure if they are Indic loans into English or they share a common root going back to Proto-Indo-European (PIE).

loot plunder/destroy, English loot.
season/weather, English equivalent is monsoon
storm, English equiv. typhoon
d – something tied around the waist, English equiv. cummerbund
– literally bad name, means bad reputation. These are both cognates to the English words bad and name.
bangalaahouse, English equiv. bungalow
priest, English equiv. pundit

Nevertheless, Hindi typically gets a high score in ratings of difficult languages to learn. Based on this high score across multiple surveys, we will give it a relatively high rating.

Hindi is rated 4, very hard to learn.

Punjabi is probably harder than any other Indic language in terms of phonology because it uses tones. It’s like Hindi with tones. It has either two or three tones: high or high-falling, low or low-rising and possibly a neutral or mid tone. It is very odd for an IE language to have tones.

Punjabi is rated 4.5, very difficult.

Eastern Zone

Bengali is similar to Hindi, but it lacks grammatical gender, and that fact alone is said to make it much easier to learn. Bengali speak tend to make stereotypical gender errors when speaking in Hindi. Nevertheless, it uses the Sanskrit alphabet, and that alone makes it hard to read and write.

Bengali is rated 3.5, harder than average to learn.

Northern Zone
Eastern Pahari

Nepali is a very difficult language to learn as it has a complex grammar. It has case not for nouns themselves but for clause constituents. It has tense, aspect, and voice. Nepali has an unbelievable 11 noun classes or genders, and affixes on the verb mark the gender, number and person of the subject. It even has split ergativity, strange for an IE language.

Nepali has the odd feature, like Japanese, of having verbs have completely different positive and negative forms.

~ hoina (I am ~ I am not)
chas ~ chainas (you (intimate) are ~ you are not)
bolchu ~ boldina (I speak ~ I don’t speak)

Note the extreme differences on the conjugation of the present tense of the verb to be between 1 singular and 2 familiar singular. They look nothing like each other at all.

Adjectives decline in peculiar way. There is an inflection on adjectives that means “qualified.” So can say this by either inflecting the adjective:

dublo ~ dublai (tall ~ quite tall)
hoco ~ hocai (short ~ rather short)
rāmro ~ rāmrai (nice ~ nice enough)

or by putting the invariant qualifying adverb in front of the adjective:

ali dubloquite tall
ali hocorather short
ali rāmronice enough

Nepali gets a 4.5 rating, very difficult.

Northwestern Zone


Sinhala is also difficult but it is probably easier than most other languages in the region.

Sinhala is rated 3, average difficulty.


Sanskrit is legendary for its difficulty. It has script that goes on for long sequences in which many small individual words may be buried. You have to take apart the sequences to find the small words. However, the words are further masked by tone sandhi running everything together. Once you tease the sandhi apart, you have to deal with hundreds of compound characters in the script. Once you do those two things, you are left with eight cases, nine declensions, dual number and other fun things.

Even native speakers tend to make grammatical mistakes are admit that parts of the grammar are fiendishly difficult. There are many grammatical features that are rarely or never found in any other language. Noun declension is based on the letter than the noun ends in, for instance, nouns that end in a, e or u all decline differently. There are three genders for nouns, and those all decline differently also. Each noun has eight cases and three numbers (singular, dual and plural) so there are 24 different forms for each noun. Counting the different combinations of endings and genders (all subsumed into a sort of noun class system) there are 20 different “noun classes.”

Combining the “noun classes” with the three genders, you end up with 1,440 different regular forms that nouns can take. To make matters worse, some of the cases have different forms themselves. And there are some exceptions to these rules. The I and you pronouns decline differently, but pronouns are simple compared to nouns.

For the verbs, each verb had exist in 10 different forms of tense or mood (one from Vedic Sanskrit is no longer used). There are six tenses and four moods. The six tenses are: one present tense, two future tenses and three past tenses. The moods are: imperative, dubitive (expresses uncertainty), optative (expresses hope or offers a benediction) and a form that expresses the concept if only, then… There are two different conjugations based on who is the beneficiary of the action, you or others. There are ten different classes of verbs, each of which conjugates differently. Additionally, each verb has a different form in the singular, dual and plural and in the 1st, 2nd and 3rd persons.

Once you get past all of that, you are ready to take on the really difficult parts of the language, participles, noun derivatives and agglutination, each of which is far more complicated than the above. To add insult to injury, Sanskrit has pitch accent.

Nevertheless, the language is so mathematically precise and regular that some have said it is a perfect language for computer programming. There may not be a single irregularity in the whole language.

Sanskrit is rated 5, extremely difficult.

Western Iranian
Southwestern Iranian


Persian is easier to learn than its reputation, as some say this is a difficult language to learn. In truth, it’s difficulty is only average, and it is one of the easier IE languages to learn. On the plus side, Persian has a very simple grammar and it is quite regular. It has no grammatical gender, no case, no articles, and adjectives never change form. Its noun system is as easy as that of English. The verbal system is a bit harder than English’s, but it is still much easier than that of even the Romance languages. The phonology is very simple.

On the down side, you will have to learn Arabic script. There are many lexical borrowings from Arabic which have no semantic equivalents in Persian.

English: two (native English word) ~ double (Latin borrowing)
Note the semantic transparency in the Latin borrowing.

Persian: do (native Persian word) ~ tasneyat (Arabic borrowing)
Note the utter lack of semantic correlation in the Arabic borrowing.

Some morphology was borrowed as well:

library (has an Arabic broken plural)

It is a quite easy language to learn at the entry level, but it is much harder to learn at the advanced level, say Sufi poetry, due to difficulty in untangling subtleties of meaning.

Persian gets a 3 rating as average difficulty.

Northwestern Iranian

Kurdish is about as hard to learn as Persian, but it has the added difficulty of pharyngeals, which are very hard for English speakers to make. Like Persian, it is no gender or case, and it also has a tense split ergative system.

Kurdish gets a 4 rating, very hard to learn.

Eastern Iranian

Ossetian is a strange Iranian language that has somehow developed ejectives due to proximity of Caucasian languages which had them. An IE language with ejectives? How odd.

Ossetian gets a 4 rating, very hard to learn.


Italian is said to be easy to learn, especially if you speak a Romance language or English, but learning to order a pizza and really mastering it are two different things. Foreigners usually do not learn Italian at anywhere near a native level.

For instance, Italian has three types of tenses – simple, compound, and indefinite.

There are also various moods that combine to take tense forms – four subjunctive moods, two conditional moods, two gerund moods, two infinite moods, two participle moods and one imperative mood.

There are eight tenses in the indicative mood – recent past, remote pluperfect, recent pluperfect, preterite (remote past), imperfect, present, future, future perfect. There are four tenses in the subjunctive mood – present, imperfect, preterite and pluperfect. There are two tenses in the conditional mood – present and preterite. There is only one tense in the imperative mood – present. Gerund, participle and infinite moods all take only present and perfect tenses.

Altogether, using these mood-tense combinations, any Italian verb can decline in up to 21 different ways. However, the truth is that most Italians have little understanding of many of these tenses and moods. They do not know how to use them correctly. Hence they are often only used by the most educated people. So an Italian learner does not really need to learn all of these tenses and moods.

Italian has many irregular verbs. There are 600 irregular verbs with all sorts of different irregularities. Nevertheless, it is a Romance language, and Romance has gotten rid of most of its irregularity. The Slavic languages are much more irregular than Romance.

Counterintuitively, some Italian words are masculine in the singular and feminine in the plural. There are many different ways to say the:





Few Italians even write Italian 100% correctly. However, there is no case in Italian, as in all of Romance with the exception of Romanian.

Italian is still easier to learn than French – for evidence see the research that shows Italian children learning to write Italian properly by age six, 6-7 years ahead of French children. This is because Italian orthography is quite sensible and coherent, with good sound-symbol correspondence. Nevertheless, the orthography is not as transparent as Spanish’s.

In a similar sense, Italian changes the meaning of verbs via addition of a verbal prefix:





In these cases, you create completely new verbs via the addition of the verbal prefix to the base. Without the prefix, it is a completely different verb.

Like German and French, Italian forms the auxiliary tense with two different words: avere and essere. This dual auxiliary system is more difficult than French’s and much more difficult than German’s.

Italian is somewhat harder to learn than Spanish or Portuguese but not dramatically so. Italian has more irregularities than those two and has different ways of forming plurals, including two different ways of forming plurals that can mean different things depending on the context. This is a leftover from the peculiarities of the Latin neutral gender. The rules about when plurals end in -io or -e are opaque.

In addition, Italian pronouns and verbs are more difficult than in Spanish. Grammar rules in Spanish are simpler and seem more sensible than in Italian. Italian has the pronominal adverbs ne and se. Their use is not at all intuitive, however, they can be learned with a bit of practice.

Italian pronunciation is a straightforward, but the ce and ci sounds can be problematic. The only sounds that will give you trouble are r, gl and gn.

Italian gets a 3.5 rating, average difficulty.

Often thought to be an Italian dialect, Neapolitan is actually a full language all of its own. In Italy, there is the Neapolitan language and Neapolitan Italian, which is a dialect or “accent” of Italian. Many Italians speak with a Neapolitan accent, and it is easy for non-Neapolitans to understand. However, the Neapolitan language is a a full blown language and is nearly incomprehensible to even speakers of Standard Italian.Neapolitan is said to be easier than Standard Italian. Unlike Italian, Neapolitan conjugation and the vocative are both quite simple and any irregularities that exist seem to follow definite patters.

Neapolitan gets a 2.5 rating, fairly easy.

Western Romance

French is pretty easy to learn at a simple level, but it’s not easy to get to an advanced level. For instance, the language is full of idioms, many more than your average language, and it’s often hard to figure them out.

One problem is pronunciation. There are many nasal vowels, similar to Portuguese. The eu, u and all of the nasal vowels can be Hell for the learner. There is also a strange uvular r. The dictionary does not necessarily help you, as the pronunciation stated in the dictionary is often at odds with what you will find on the street.

There are phenomena called élision, liaison and enchainement, which is similar to sandhi in which vowels elide between words in fast speech. There are actually rules for this sort of thing, but the rules are complicated, and at any rate, for liaisons at least, they are either obligatory, permitted or forbidden depending on the nature of the words being run together, and it is hard to remember which category various word combinations fall under.

The orthography is also difficult since there are many sounds that are written but no longer pronounced, as in English. Also similar to English, orthography does not line up with pronunciation. For instance, there are 13 different ways to spell the o sound: o, ot, ots, os, ocs, au, aux, aud, auds, eau, eaux, ho and ö.

In addition, spoken French and written French can be quite different. Spoken French uses words and phrases such as c’est foututhe job will not be done, and on which you might never see in written French.

The English language, having no Language Committee, at least has an excuse for the frequently irrational nature of its spelling.

The French have no excuse, since they have a committee that is set up in part to keep the language as orthographically irrational as possible. One of their passions is refusing to change the spelling of words even as pronunciation changes, which is the opposite of what occurs in any sane spelling reform. So French is, like English, frozen in time, and each one has probably gone as long as the other with no spelling reform.

Furthermore, to make matters worse, the French are almost as prickly about writing properly as they are about speaking properly, and you know how they are about foreigners mangling their language.

Despite the many problems of French orthography, there are actually some rules running under the whole mess, and it is quite a bit more sensible than English orthography, which is much more chaotic.

French has a language committee that is always inventing new native French words to keep out the flood of English loans. They have a website up with an official French dictionary showing the proper native coinages to use. Another one for computer technology only is here.

On the plus side, French has a grammar that is neither simple nor difficult; that, combined with a syntax is pretty straightforward and a Latin alphabet make it relatively easy to learn for most Westerners. In addition, the English speaker will probably find more instantly recognizable cognates in French than in any other language.

A good case can be made that French is harder to learn than English. Verbs change much more, and it has grammatical gender. There are 15 tenses in the verb, 18 if you include the pluperfect and the Conditional Perfect 2 (now used only in Literary French) and the past imperative (now rarely used). That is quite a few tenses to learn, but Spanish and Portuguese have similar situations.

A good case can be made that French is harder to learn than Italian in that French children do not learn to write French properly until age 12-13, six years after Italian children.

Its grammar is much more complicated than Spanish’s. Although the subjunctive is more difficult in Spanish than in French, French is much more irregular. Like German, there are two different ways to form the auxiliary tense to have. In addition, French uses particles like y and en that complicate the grammar quite a bit.

French is one of the toughest languages to learn in the Romance family.  In many Internet threads about the hardest language to learn, many language learners list French as their most problematic language.

This is due to the illogical nature of French spelling discussed above such that the spelling of many French words must be memorized as opposed to applying a general sound-symbol correspondence rule. In addition, French uses both acute and grave accents – `´.

French gets a 3.5 rating for more than average difficulty.


Spanish is often said to be one of the easiest languages to learn, though this is somewhat controversial. Personally, I’ve been learning it off and on since age six, and I still have problems, though Spanish speakers say my Spanish is good, but Hispanophones, unlike the French, are generous about these things.

It’s quite logical, though the verbs do decline a lot with tense and number, and there are many irregular verbs, similar to French.

Compare English declensions to Spanish declensions of the verb to read.


I read
He reads


Yo leo
Tu lees
El lee
Nosotros leemos
Vosotros leéis
Ellos leen
pudísteis haber leído
hubiéremos ó hubiésemos leído

Nevertheless, Romance grammar is much more regular than, say, Polish, as Romance has junked most of the irregularity. Spanish has the good grace to lack case, spelling is a piece of cake, and words are spoken just as they are written. However, there is a sort of case left over in the sense that one uses different pronouns when referring to the direct object (accusative) or indirect object (dative).

Spanish is probably the most regular of the Romance languages, surely more regular than French or Portuguese, and probably more regular than Italian or Romanian. Pluralization is very regular compared to say Italian. There are generally only two plurals, -s and -es, and the rules about when to use one or the other are straightforward. There is only one irregular plural:

hipérbaton -> hipérbatos

This is in reference to a literary figure and you would never use this form in day to day speech.

The trilled r in Spanish often hard for language learners to make.

There is a distinction in the verb to be with two different forms, ser and estar. Non-native speakers almost never learn the use these forms as well as a native speaker. The subjunctive is also difficult in Spanish, and L2 learners often struggle with it after decades of learning.

Spanish pronunciation is fairly straightforward, but there are some sounds that cause problems for learners: j, ll, ñ, g, and r.

One good thing about Spanish is Spanish speakers are generally grateful if you can speak any of their language at all, and are very tolerant of mistakes in L2 Spanish speakers.

Spanish is considered to be easier to learn for English speakers than many other languages, including German. This is because Spanish sentences follow English sentence structure more than German sentences do. Compared to other Romance languages, Spanish one of the easiest to learn. It is quite a bit easier than French, moderately easier than Literary Portuguese, and somewhat easier than Italian.

Nevertheless, Hispanophones say that few foreigners end up speaking like natives. Part of the reason for this is that Spanish is very idiomatic and the various forms of the subjunctive make for a wide range of nuance in expression. Even native speakers make many mistakes when using the subjunctive in conditional sentences. The dialects do differ quite a bit more than most people say they do. The dialects in Latin America and Spain are quite different, and in Latin America, the Argentine and Dominican dialects are very divergent.

Spanish gets rated 2.5, fairly easy.


Portuguese, like Spanish, is also very easy to learn, though Portuguese pronunciation is harder due to the unusual vowels such as nasal diphthongs and the strange palatal lateral ʎ, which many English speakers will mistake for an l.

Of the nasal diphthongs, ão is the hardest to make. In addition, Brazilian (Br) Portuguese has an r that sounds like an h, and l that sounds like a w and a d that sounds like a j, but only some of the time! Fortunately, in European (Eu) Portuguese, all of these sounds sound as you would expect them to.

Portuguese has two r sounds, a tapped r (ɾ) that is often misconceived as a trilled r (present in some British and Irish English dialects) and an uvular r (ʁ) which is truly difficult to make. However, this is the typical r sound found in French, German, Danish and Hebrew, so if you have a background in one of those languages, this should be an easy sound.  L2 learners not only have a hard time making them but also mix them up sometimes.

You can run many vowels together in Portuguese and still make a coherent sentence. See here:

É o a ou o b? [Euaoube]
Is it (is your answer) a or b?

That utterance turns an entire sentence into a single verb via run-on vowels, five of them in a row.

Most Portuguese speakers say that Portuguese is harder to learn than Spanish, especially the variety spoken in Portugal. Eu Portuguese elides many vowels and has more sounds per symbol than Br Portuguese does. Portuguese has both nasal and oral vowels, while Spanish has only oral values. In addition, Portuguese has 12 vowel phonemes to Spanish’s five.

Portuguese has also retained the archaic subjunctive future which has been lost in many Romance languages.

Try this sentence: When I am President, I will change the law.

In Spanish, one uses the future tense as in English:

Cuando yo soy presidente, voy a cambiar la ley.

In Portuguese, you use the subjunctive future, lost in all modern Romance languages and lacking in English:

Quando eu for presidente, vou mudar a lei. – literally, When I may be President, I will possibly change the law.

The future subjunctive causes a lot of problems for Portuguese learners and is one of the main ways that it is harder than Spanish.

There is a form called the personal infinitive in Eu Portuguese in which the infinitive is actually inflected that also causes a lot of problems for Portuguese learners.

Personal infinitive:

para eu cantar      for me to sing
para tu cantares    for you to sing
para el cantar      for him to sing
para nos cantarmos  for us to sing
para eles cantarem  for them to sing

Some sentences with the personal infinitive:

Ficamos em casa do Joao ao irmos ao Porto.
We are staying at John’s when we go to Porto.

Comprei-te um livro para o leres.
I bought you a book for you to read.

In addition, when making the present perfect in Spanish, it is fairly easy with the use have + participle as in English.

Compare I have worked.

In Spanish:

Yo he trabajado.

In Portuguese, there is no perfect to have nor is there any participle, instead, present perfect is formed via a conjugation that varies among verbs:

Eu trabalhei – because Eu hei trabalhado makes no sense in Portuguese.

Portuguese still uses the pluperfect tense quite a bit, a tense that gone out or is heading out of most IE languages. The pluperfect is used a lot less now in Br Portuguese, but it is still very widely used in Eu Portuguese. The pluperfect is used to discuss a past action that took place before another past action. An English translation might be:

He had already gone by the time she showed up.

The italicized part would be the equivalent to the pluperfect in English.

O pássaro voara quando o gato pulou sobre ele para tentar comê-lo.
The bird had (already) flown away when the cat jumped over it trying to eat it.

Even Br Portuguese has its difficulties centering around diglossia. It is written in 1700’s Eu Portuguese, but in speech, the Brazilian vernacular is used. Hence:

I love you

Amo-te or Amo-o [standard, written]
Eu te amo or Eu amo você  [spoken]

We saw them

Vimo-los [standard, written]
A gente viu eles  [spoken]

Even Eu Portuguese native speakers often make mistakes in Portuguese grammar when speaking. Young people writing today in Portuguese are said to be notorious for not writing or speaking it properly. The pronunciation is so complicated and difficult that even foreigners residing in Portugal for a decade never seem to get it quite right. In addition, Portuguese grammar is unimaginably complicated. There are probably more exceptions than there are rules, and even native speakers have issues with Portuguese grammar.

Portuguese gets a 3 rating, average difficulty.

Eastern Romance

Surprisingly enough, Romanian is said to be one of the harder Romance languages to speak or write properly. Even Romanians often get it wrong. One strange thing about Romanian is that the articles are attached to the noun as suffixes. In all the rest of Romance, articles are free words that precede the noun.

English  telephone the telephone
Romanian telefon   telefonul

Romanian is the only Romance language with case. There are five cases – nominative, accusative, genitive, dative, vocative – but vocative is not often use, and the other four cases combine as two cases: nominative/accusative and dative/genitive merge as single cases.

Nominative-Accusative aeroportul
Genitive-Dative       aeroportului

The genitive is hard for foreigners to learn as is the formation of plurals. The ending changes for no apparent reason when you pluralize a noun and there are also sound changes:

brad (singular)
brazi (plural)

Many native speakers have problems with plurals and some of the declensions. Unlike the rest of Romance which has only two genders, masculine and feminine, Romanian has three genders – masculine, feminine and neuter (the neuter is retained from Latin). However, neuter gender is realized on the surface as masculine in the singular and feminine in the plural, unlike languages such as Russian where neuter gender is an entirely different gender.

The pronunciation is not terribly difficult, but it is hard to learn at first. For some odd reason, the Latinization is considered to be terrible.

Romanian is harder to learn than Spanish or Italian and possibly harder than French. However, you can have odd sentences with nothing but vowels as in Maori.

Aia-i oaia ei, o iau eu?
That’s her sheep, should I take it?

It may have the most difficult grammar in Romance. Romanian has considerable Slavic influence and this will make it harder for the English speaker to learn than other Romance languages.

Romanian gets a 3.5 rating, more than average difficulty.

West Germanic

People often say that English is easy to learn, but that is deceptive. For one thing, English has anywhere from 500,000-1 million words (said to be twice as much as any other language – but there are claims that Dutch and Arabic each have 4 million words), and the number increases by the day. Furthermore, most people don’t understand more than 50,000, and a majority might only understand 30,000 words. Yet your average person only uses 5,000 at most.

Actually, the average American or Brit uses a mere 2,500 words. As we might expect, our cultivated Continentals in Europe, such as Spaniards and French, probably have twice the regular vocabulary of English speakers and far more colloquial expressions.

In addition, verbal phrases or phrasal verbs are a nightmare. Phrasal verbs are probably left over from “separable verbs” in German. In most of the rest of IE, these become affixes as in Latin Latin cum-, ad-, pro-, in-, ex-, etc.. In many cases, phrasal verbs can have more than 10 different antagonistic meanings.

Here is a list of 123 phrasal verbs using the preposition up after a verb:

Back up – to go in reverse, often in a vehicle, or to go back over something previously dealt with that was poorly understood in order to understand it better.
Be up – to be in a waking state after having slept. I’ve been up for three hours. Also to be ready to do something challenging. Are you up for it?
Beat up
– to defeat someone thoroughly in a violent physical fight.
Bid up – to raise the price of something, usually at an auction, by calling out higher and higher bids.
Blow up – to explode an explosive or for a social situation to become violent and volatile.
Bone up – to study hard.
Book up – all of the booking seats have been filled for some entertainment or excursion.
Bottle up – to contain feelings until they are at the point of exploding.
Break up – to break into various pieces, or to end a relationship, either personal or between entitles, also to split a large entity, like a large company or a state.
Bruise up – to receive multiple bruises, often serious ones.
Brush up – to go over a previously learned skill.
Build up – to build intensively in an area, such as a town or city, from a previously less well-developed state.
Burn up – burn completely or to be made very angry.
Bust up – to burst out in laughter.
Buy up – to buy all or most all of something.
Call up – to telephone someone. Or to be ordered to appear in the military. The army called up all males aged 18-21 and ordered them to show up at the nearest recruiting office.
Catch up
– to reach a person or group that one had lagged behind earlier, or to take care of things, often hobbies, that had been put off by lack of time.
Chat up – to talk casually with a goal in mind, usually seduction or at least flirtation.
Cheer up – to change from a downcast mood to a more positive one.
Chop up – to cut into many, often small, pieces.
Clam up – to become very quiet suddenly and not say a thing.
Clean up – to make an area thoroughly tidy or to win completely and thoroughly.
Clear up – for a storm to dissipate, for a rash to go away, for a confusing matter to become understandable.
Close up – to close, also to end business hours for a public business.
Come up – to approach closely, to occur suddenly or to overflow.
Cook up – to prepare a meal or to configure a plan, often of a sly, ingenious or devious nature. They cooked up a scheme to swindle the boss.
Crack up
– to laugh, often heartily or to fall apart emotionally.
Crank up – elevate the volume.
Crawl up – to crawl inside something.
Curl up – to rest in a curled body position, either alone or with another being.
Cut up – to shred or to make jokes, often of a slapstick variety.
Do up – apply makeup to someone, often elaborately.
Dream up – to imagine a creative notion, often an elaborate one.
Dress up – to dress oneself in formal attire.
Drive up – to drive towards something and then stop, or to raise the price of something by buying it intensively.
Drum up – to charge someone with wrongdoing, usually criminal, usually by a state actor, usually for false reasons.
Dry up – to dessicate.
Eat up – implies eating something ravenously or finishing the entire meal without leaving anything left.
End up – to arrive at some destination after a long winding, often convoluted journey either in space or in time.
Face up – to quit avoiding your problems and meet them head on.
Feel up – to grope someone sexually.
Get up – to awaken or rise from a prone position.
Give up – to surrender, in war or a contest, or to stop doing something trying or unpleasant that is yielding poor results, or to die, as in give up the ghost.
Grow up – to attain an age or maturity or to act like a mature person, often imperative.
Hang up – to place on a hanger or a wall, to end a phone call.
Hike up – to pull your clothes up when they are drifting down on your body.
Hit up – to visit someone casually or to ask for a favor or gift, usually small amounts of money.
Hold up – to delay, to ask someone ahead of you to wait, often imperative. Also a robbery, usually with a gun and a masked robber.
Hook up – to have a casual sexual encounter or to meet casually for a social encounter, often in a public place; also to connect together a mechanical devise or plug something in.
Hurry up – imperative, usually an order to quit delaying and join the general group or another person in some activity, often when they are leaving to go to another place.
Keep up – to maintain on a par with the competition without falling behind.
Kiss up – to mend a relationship after a fight.
Knock up – to impregnate.
Lay up – to be sidelined due to illness or injury for a time.
Let up – to ease off of someone or something, for a storm to dissipate, to stop attacking someone or s.t.
Lick up – to consume all of a liquid.
Light up – to set s.t. on fire or to smile suddenly and broadly.
Lighten up – to reduce the downcast or hostile seriousness of the mood of a person or setting.
Listen up – imperative – to order someone to pay attention, often with threats of aggression if they don’t comply.
Live up – to enjoy life.
Lock up – to lock securely, often locking various locks, or to imprison, or for an object or computer program to be frozen or jammed and unable to function.
Look up – to search for an item of information in some sort of a database, such as a phone book or dictionary. Also to admire someone.
Make up – to make amends, to apply cosmetics to one’s face or to invent a story.
Man up – to elevate oneself to manly behaviors when one is slacking and behaving in an unmanly fashion.
Mark up – to raise the price of s.t.
Measure up – in a competition, for an entity to match the competition.
Meet up – to meet someone or a group for a get meeting or date of some sort.
Mess up – to fail or to confuse and disarrange s.t. so much that it is bad need or reparation.
Mix up – to confuse, or to disarrange contents in a scattered fashion so that it does not resemble the original.
Mop up – mop a floor or finish off the remains of an enemy army or finalize a military operation.
Move up – to elevate the status of a person or entity in competition with other entities- to move up in the world.
Open up – when a person has been silent about something for a long time, as if holding a secret, finally reveals the secret and begins talking.
Own up – to confess to one’s sins under pressure and reluctantly.
Pass up – to miss an opportunity, often a good one.
Patch up – to put together a broken thing or relationship.
Pay up – to pay, usually a debt, often imperative to demand payment of a debt, to pay all of what one owes so you don’t owe anymore.
Pick up – to grasp an object and lift it higher, to seduce someone sexually or to acquire a new skill, usually rapidly.
Play up – to dramatize.
Pop up – for s.t. to appear suddenly, often out of nowhere.
Put up – to hang, to tolerate, often grudgingly, or to put forward a new image.
Read up – to read intensively as in studying.
Rev up – to turn the RPM’s higher on a stationary engine.
Ring up – to telephone someone or to charge someone on a cash register.
Rise up – for an oppressed group to arouse and fight back against their oppressors.
Roll up – to roll s.t. into a ball, to drive up to someone in a vehicle or to arrest all the members of an illegal group. The police rolled up that Mafia cell quickly.
Run up
– to tally a big bill, often foolishly or approach s.t. quickly.
Shake up – to upset a paradigm, to upset emotionally.
Shape up – usually imperative command ordering someone who is disorganized or slovenly to live life in a more orderly and proper fashion.
Shoot up – to inject, usually illegal drugs, or to fire many projectiles into a place with a gun.
Show up – to appear somewhere, often unexpectedly.
Shut up – to silence, often imperative, fighting words.
Sit up – to sit upright.
Slip up – to fail.
Speak up – to begin speaking after listening for a while, often imperative, a request for a silent person to say what they wish to say.
Spit up – to vomit, usually describing a child vomiting up its food.
Stand up – to go from a sitting position to a standing one quickly.
Start up – to initialize an engine or a program, to open a new business to go back to something that had been terminated previously, often a fight; a recrudescence.
Stay up – to not go to bed.
Stick up – to rob someone, usually a street robbery with a weapon, generally a gun.
Stir up – stir rapidly, upset a calm surrounding or scene or upset a paradigm.
Stop up – to block the flow of liquids with some object(s).
Straighten up – to go from living a dissolute or criminal life to a clean, law abiding one.
Suck up – to ingratiate oneself, often in an obsequious fashion.
Suit up – to get dressed in a uniform, often for athletics.
Sweep up – to arrest all the members of an illegal group, often a criminal gang.
Take up – to cohabit with someone – She has taken up with him. Or to develop a new skill, to bring something to a higher elevation, to cook something at a high heat to where it is assimilated.
Talk up – to try to convince someone of something by discussing it dramatically and intensively.
Tear up – to shred.
Think up – to conjure up a plan, often an elaborate or creative one.
Throw up – to vomit.
Touch up – to apply the final aspects of a work nearly finished.
Trip up – to stumble mentally over s.t. confusing.
Turn up – to increase volume or to appear suddenly somewhere.
Vacuum up – to vacuum.
Use up – to finish s.t. completely so there is no more left.
Wait up – to ask other parties to wait for someone who is coming in a hurry.
Wake up – to awaken.
Walk up – to approach someone or something.
Wash up – to wash.
Whip up – to cook a meal quickly or for winds to blow wildly.
Work up – to exercise heavily, until you sweat to work up a sweat. Or to generate s.t. a report or s.t. of that nature done rather hurriedly in a seat of the pants and unplanned fashion. We quickly worked up a formula for dealing with the matter.
Wrap up
– To finish something up, often something that is taking too long. Come on, let us wrap this up and getting it over with. Also, to bring to a conclusion that ties the ends together. The story wraps up with a scene where they all get together and sing a song.
Write up
– often to write a report of reprimand or a violation. The officer wrote him for having no tail lights.

Here are  phrasal verbs using the preposition down:

Back down – to retreat from a challenge or a threat.
Be down  – to be ready to ready to do something daring, often s.t. bad, illegal or dangerous, such as a fight or a crime. Are you down?
Blow down – to knock something down via a strong wind.
Break down – to take anything apart in order to reveal its component parts.
Burn down
– reduce s.t. to ashes, like a structure.
Chop down – to fell a tree with an ax.
Clamp down – to harshly police something bad in order to reduce its incidence, especially s.t. that had been ignored in the past.
Climb down – to retract a poorly made statement.
Cook down – to reduce the liquid content in a cooked item.
Crack down – To police harshly against people doing bad things.
Cut down – to fell a tree by any means or to reduce the incidence of anything, especially something bad.
Drink down – to consume all of s.t.
Drive down – to harshly bring down the price of something, often through brutal means. Investors drove down the price of the stock after the company’s latest product failed badly.
Dress down – to deliberately dress more poorly than expected, often as a trendy fashion statement.
Get down – to have fun and party, or to lie prone and remain there or to reduce something to bare essentials. Get down on the floor or Getting down to brass tacks, how can we possibly explain this anomaly other than in this particular manner?
Hang down – to let one’s hair fall down in front of one’s eyes or to hang s.t. often a banner, from a building or structure.
Hike down – to lower one’s pants. The gangsters hike their pants down to look tough.
Hold down – to hold someone or s.t. on the floor so they cannot rise or get up.
Keep down – to prevent a group, often a repressed group, from achieving via oppression by a ruling group. The Whites are keeping us Black people down.
Kick down – Drug slang meaning to contribute your drugs to a group drug stash so others can consume them with you, to share your drugs with others. Often used in a challenging sense.
Knock down – to hit or strike something so hard that it falls to the ground or collapses.
Let down – to be discouraged by something one had high hopes for.
Live down – to recover from a humiliating experience. After he was publicly humiliated, he was never able to live down his rejection by the people.
Look down – to regard someone in a negative or condemnatory way from a the point of a superior person.
Mark down – to discount the price of s.t., often significantly.
Party down – to have fun and party
Pass down – to leave s.t. of value to someone as an inheritance after a death or to inherit a saying or custom via one’s ancestors through time. It was passed down through the generations.
Pat down – to frisk.
Pay down – to reduce a bill, often a large bill, by making payments, often significant payments. We are slowly paying down that bill.
Play down – to reduce the significance of s.t. often s.t. negative, often in order to deceive people into thinking s.t. is better than it really is.
Put down – to criticize someone in a condescending way as a superior person, to insult.
Play down – to deemphasize.
Rip down – to tear s.t. off of a wall such as a sheet or poster.
Run down – to run over something or someone with a vehicle, to review a list or to attack someone verbally for a long time.
Set down – to postulate a set of rules for something.
Shake down – to rob someone purely through the use of verbal or nonphysical force or power.
Shoot down – to shoot at a flying object like a plane, hitting it so it crashes to the ground or to reject harshly a proposal.
Shut down – to close operations of an entity.
Speak down – to talk to someone in a condescending way from the point of view of a superior person.
Take down – to demolish s.t. like a building, to tackle someone, or to raid and arrest many members of an illegal organization.
Talk down – to speak to someone in an insulting manner as if one was superior or to mollify a very angry person to keep them from causing future damage. The police were able to talk down the shooter until he laid down his fun and set the hostages free.
Tear down – to demolish or destroy someone verbally or to destroy s.t. by mechanical means.
Throw down – to throw money or tokens into the pile in the center when gambling.
Turn down – to reduce the volume of something or to reject an offer.
Write down – to write on a sheet of paper

There are figures of speech and idioms everywhere (some estimate that up to 20% of casual English speech is idiomatic), and it seems impossible to learn them all. In fact, few second language learners get all the idioms down pat.

The spelling is insane and hardly follows any rules at all. The English spelling system in some ways is frozen at about the year 1500 or so. The pronunciation has changed but the spelling has not. Careful studies have shown that English-speaking children take longer to read than children speaking other languages (Finnish, Greek and various Romance and other Germanic languages) due to the difficulty of the spelling system. Romance languages were easier to read than Germanic ones.

This may be why English speakers are more likely to be diagnosed dyslexic than speakers of other languages. The dyslexia still exists if you speak a language with good sound-symbol correspondence, but it’s covered up so much by the ease of the orthography that it seems invisible, and the person can often function well. But for a dyslexic, trying to read English is like walking into a minefield.

Letters can make many different sounds, a consequence of the insane spelling system. A single sound can be spelled in many different ways: e can be spelled e, ea, ee, ei, eo, ey, ae, i, ie, and y. The k sound can spelled as c, cc, ch, ck, k, x, and q.

The rules governing the use of the indefinite, definite and zero article are opaque and possibly don’t even exist. There are synonyms for almost every word in a sentence, and the various shades of meaning can be difficult to discern. In addition, quite a few words have many different meanings. There are strange situations like read and read, which are pronounced differently and mean two different things.

English word derivation is difficult to get your mind around because of the dual origins of the English language in both Latin/French and German.

See and hear and perceptible and audible mean the same thing, but the first pair is derived from German, and the second pair is derived from Latin.

English word derivation is irregular due for the same reason:

assumeassumption (Latin)
childchildish (German)
buildbuilding (German)

In English we have at least 12 roots with the idea of two in them:


However, English regular verbs generally have only a few forms in their normal paradigm. In this arrangement, there are only five forms of the verb in general use for the overwhelming majority of verbs:

present except 3rd singular  steal
3rd person singular          steals
progressive                  stealing
past                         stole
perfect                      stolen

Even a language like Spanish has many more basic forms than that. However, coming from an inflected language, the marking of only the 3rd singular and not marking anything else may seem odd.

The complicated part of English verbs is not their inflection – minimal as it is – but instead lies in the large number of irregular verbs.

There is also the oddity of the 2nd person being the same in both the singular and the plural – you. Some dialects such as US Southern English do mark the plural – you all or y’all.

English prepositions are notoriously hard, and few second language learners get them down right because they seem to obey no discernible rules.

One problem that English learners complain of is differential uses of have.

  1. Perfect tense. I have done it.
  2. Deontic (must). I have to do it.
  3. Causative. I had it done.

While English seems simple at first – past tense is easy, there is little or no case, no grammatical gender, little mood, etc., that can be quite deceptive. In European countries like Croatia, it’s hard to find a person who speaks English with even close to native speaker competence.

There are quite a few English dialects – over 100 have been recorded in London alone.

The problem with English is that it’s a mess! There are languages with very easy grammatical rules like Indonesian and languages with very hard grammatical rules like Arabic. English is one of those languages that is simply chaotic. There are rules, but there are exceptions everywhere and exceptions to the exceptions. Grammatically, it’s disaster area. It’s hard to know where to start.

However, it is often said that English has no grammatical rules. Even native speakers make this comment because that is how English seems due to its highly irregular nature. Most English native speakers, even highly educated ones, can’t name one English grammatical rule. Just to show you that English does have rules though, I will list some of them.

*Indicates an ungrammatical form.

Adjectives appear before the noun in noun phrases:

Small dogs barked.
*Dogs small barked.

Adjectives are numerically invariant:

the small dog
the small dogs
The dog is small.
The dogs are small.

Intensifiers appear before both attributive and predicative adjectives:

The very small dog barked.
*The small very dog barked.

The dog was very small.
*The dog was small very.

Attributive adjectives can have complements:

The dog was scared.
The dog was scared of cats.

But predicative adjectives cannot:

The scared dog barked.
*The scared of cats dog barked.

Articles, quantifiers, etc. appear before the adjective (and any intensifier) in a noun phrase:

The very small dog barked.
*Very the small dog barked.
*Very small the dog barked.

Every very small dog barked.
*Very every small dog barked.
*Very small every dog barked.

Relative clauses appear after the noun in a noun phrase:

The dog that barked.
*The that barked dog.

The progressive verb form is the bare form with the suffix -ing, even for the most irregular verbs in the language:



The infinitive verb form is to followed by the bare form, even for the most irregular verbs in the language:

to be
to have
to do

*to was
*to are
*to am.

The imperative verb form is the bare form, even for the most irregular verb in the language:



All 1st person present, 2nd person present, and plural present verb forms are equivalent to the bare form, except for to be.

All past tense verb forms of a given verb are the same regardless of person and number, except for to be.

Question inversion is optional:

You are leaving?
Are you leaving?

But when inversion does occur in a wh-question, a wh-phrase is required to be fronted:

You’re seeing what?
What are you seeing?

*Are you seeing what?

Wh-fronting is required to affect an entire noun phrase, not just the wh-word:

You are going to which Italian restaurant?
Which Italian restaurant are you going to?

*Which are you going to Italian restaurant?
*Which Italian are you going to restaurant?
*Which restaurant are you going to Italian?

Wh-fronting only happens once, never more:

What are you buying from which store
Which store are you buying what from?

*What which store are you buying from?
*Which store what are you buying from?

The choice of auxiliary verb in compound past sentences does not depend on the choice of main verb:

I have eaten.
I have arrived.

*I am eaten.
*I am arrived.

cf. French

J’ai mangé.
Je suis arrivé.

English can be seen as an inverted pyramid in terms of ease of learning. The basics are easy, but it gets a lot more difficult as you progress in your learning.

While it is relatively easy to speak it well enough to be more or less understandable most of the time, speaking it correctly is often not possible for a foreigner even after 20 years of regular use.

English only gets a 2.5 rating , somewhat difficult.

High German

German’s status is controversial. It’s long been considered hard to learn, but many learn it fairly easily.

Pronunciation is straightforward, but there are some problems with the müde, the Ach, and the two ch sounds in Geschichte. Although the first one is really an sch instead of a ch, English speakers lack an sch, so they will just see that as a ch. Further, there are specific rules about when to use the ss (or sz as Germans say) or hard s. The r in German is a quite strange ʁ, and of common languages, only French has a similar r. The çχ and ‘ü sounds can be hard to make. Consonant clusters like Herkunftswörterbuch or Herbstpflanze can be be difficult. German permits the hard to pronounce shp and shtr consonant clusters. Of the vowels, ö and ü seem to cause the most problems.

German grammar is quite complex. It recently scored as one of the weirdest languages in Europe on a study, and it also makes it onto worst grammars lists. The main problem is that everything is irregular. Nouns, plurals, determiners, adjectives, superlatives, verbs, participles – they are all irregular. It seems that everything in the language is irregular.

There are six different forms of the depending on the noun case:


but 16 different slots to put the six forms in, and the gender system is irrational. In a more basic sense and similar to Danish, there are three basic forms of the:


Each one goes with a particular noun, and it’s not very clear what the rules are.

One problem with German syntax is that the verb, verbs or parts of verbs doesn’t occur until the end of the sentence. This sentence structure is known as V2 syntax, and it is quite alien for English speakers. There are verbal prefixes, and they can be modified in all sorts of ways that change meanings in a subtle manner. There are dozens of different declension types for verbs, similar to Russian and Irish. There are also quite a few irregular verbs that do not fit into any of the paradigms.

German also has Schachtelsätze, box clauses, which are like clauses piled into other clauses. In addition, subclauses use SOV word order. Whereas in Romance languages you can often throw words together into a sentence and still be understood if not grammatical, in German, you must learn the sentence structure – it is mandatory and there is no way around it. The syntax is very rigid but at least very regular.

German case is also quite regular. The case exceptions can be almost counted on one hand. However, look at the verb:


in which the direct object is in dative rather than the expected absolutive.

An example of German case (and case in general) is here:

The leader of the group gives the boy a dog.

In German, the sentence is case marked with the four different German cases:

Der Führer (nominative)
der Gruppe
gibt dem Jungen (dative)
einen Hund (accusative).

There are three genders, masculine, feminine and neutral. Yet it is difficult to tell which gender any particular noun is based on looking at it, for instance, petticoat is masculine! Any given noun inflects via the four cases and the three genders. Furthermore, the genders change between masculine and feminine in the same noun for no logical reason. Gender seems to be one of the main problems that German learners have with the language. Figuring out which word gets which gender must simply be memorized as there are no good clues.

Phonology also changes strangely as the number of the noun changes:

Haushouse (singular)
Haeuserhouses (plural with umlaut)

But to change the noun to a diminutive, you add -chen:

Haueschen – little house (singular, yet has the umlaut of the plural)

This is part of a general pattern in Germanic languages of roots changing the vowel as verbs, adjectives and nouns with common roots change from one into the other. For instance, in English we have the following vowel changes in these transformed roots:

foul filth
tell tale
long length
full fill
hot  heat
do   does

Much of this has gone out of English, but it is still very common in German. Dutch is in between English and German.


For sick, we have:

krank      sick
kränker    sicker
kränklich  sickly
krankhaft  pathological
kranken an to suffer from
kränken    to hurt
kränkeln   to be ailing
erkranken  to fall ill

For good, we have:

gut     good
Güte    goodness
Gut     a good
Güter   goods
gütig   kind
gütlich amicable

German also has a complicated preposition system.

German also has a vast vocabulary, the fourth largest in the world. This is either positive or negative depending on your viewpoint. Language learners often complain about learning languages with huge vocabularies, but as a native English speaker, I’m happy to speak a language with a million words. There’s a word for just about everything you want to say about anything, and then some!

On the plus side, word formation is quite regular.

Pollution is Umweltverschmutzung. It consists, logically, of two words, Umwelt and Verschmutzung, which mean environment and dirtying.

In English, you have three words, environment, dirtying and pollution, the third one, the combination of the first two, has no relation to its semantic roots in the first two words.

Nevertheless, this has its problems, since it’s not simple to figure out how the words are stuck together into bigger words, and meanings of morphemes can take years to figure out.

German has phrasal verbs as in English, but the meaning is often somewhat clear if you take the morphemes apart and look at their literal meanings. For instance:

vorschlagento suggest parses out to er schlägt vorto hit forth

whereas in English you have phrasal verbs like to get over with which even when separated out, don’t make sense literally.

German, like French and Italian, has two auxiliary tenses – habe and bin. However, their use is quite predictable and the tenses are not inflected so the dual auxiliary is easier in German than in French or especially Italian.

Reading German is actually much easier than speaking it, since to speak it correctly, you need to memorize not only genders but also adjectives and articles.

German is not very inflected, and the inflection that it does take is more regular than many other languages. Furthermore, German orthography is phonetic, and there are no silent letters.

German, like Dutch, is being flooded with English loans. While this helpful to the English speaker, others worry that the language is at risk of turning into English.

Learning German can be seen as a pyramid. It is very difficult to grasp the basics, but once you do that, it gets increasingly easy as the language follows relatively simple rules and many words are created from other words via compound words, prefixes and suffixes.

Rating German is hard to do. It doesn’t seem to deserve to a very high rating, but it makes a lot of people’s “hardest language you ever tried to learn” list for various reasons.

German gets a 3.5 rating, moderately difficult.

Low Franconian

While Dutch syntax is no more difficult than English syntax, Dutch is still harder to learn than English due to the large number of rules used in both speaking and writing. The Dutch say that few foreigners learn to speak Dutch well. Part of the problem is that some words have no meaning at all in isolation (meaning is only derived via a phrase or sentence). Word order is somewhat difficult because it is quite rigid. In particular, there are complex and very strange rules about the order of verbs in verbal clusters. It helps if you know German as the rule order is similar, but Dutch word order is harder than German word order. Foreigners often seem to get the relatively lax Dutch rules about word order wrong in long sentences.

Verbs can be difficult. For instance, there are no verbs get and move. Instead, get and move each have about a dozen different verbs in Dutch. A regular Dutch verb has six different forms.

Dutch spelling is difficult, and most Dutch people cannot even spell Dutch correctly. There are only two genders – common and neuter – as opposed to three in German – feminine, neuter and masculine. In Dutch, the masculine and feminine merged in the common gender. But most Dutch speakers cannot tell you the gender of any individual word, in part because there are few if any clues to the gender of any given noun.

There are remnants of the three gender system in that the Dutch still use masculine/feminine for some nouns. In the Netherlands now, most Dutch speakers are simply using masculine (common) for most nouns other than things that are obviously feminine like the words mother and sister.

However, in Belgium, where people speak Flemish, not Dutch, most people still know the genders of words. Not only that but the 3-gender system with masculine, feminine and neuter remains in place in Flemish. In addition, in Flemish, the definite article still makes an obvious distinction between masculine and feminine, so it is easy to figure out the gender of a noun:

ne man, nen boom, nen ezel, nen banaan (masculine)
een vrouw, een koe, een wolk, een peer (feminine)

In addition, most Dutch speakers cannot tell you what pronoun to use in the 3rd person singular when conjugating a verb.

This is because there are two different systems in use for conjugating the 3sing.

The basic paradigm is:

hij      he
zij (ze) she
het      it

System 1
male persons    hij
female persons  zij
neuter words    het
animals         hij, unless noun = neuter
objects         hij, "       "
abstractions    zij, "       "
substances      hij, "       "

System 2
male persons      hij
female persons    zij
all animals       hij
all objects       hij
all abstractions  zij
all substances    het

For instance, melk is a common noun. Under system 1, it would be hij. But under system 2, it would be het because it is a substance.

The er word is tricky in Dutch. Sometimes it is translated as English there, but more often then not it is simply not translated in English translations because there is no good translation for it. There are two definite articles, de and het, and they are easily confused.

Dutch has something called modal particles, the meanings of which are quite obscure.

Some say Dutch is irregular, but the truth is that more than Dutch has a multitude of very complex rules, rules that are so complicated that is hard to even figure them out, much less understand them. Nevertheless, Dutch has 200 irregular verbs.

In some respects, Dutch is a more difficult language than English. For instance, in English, one can simply say:

The tree is in the garden.

But in Dutch (and also in German) you can’t say that. You have to be more specific. What is the tree doing in the garden? Is it standing there? Is it lying on the grass? You have to say not only that the tree is in the garden, but what it is doing there.

In Dutch, you need to say:

Daar ligt een boom in de tuin.
The tree is standing in the garden.

Daar ligt een boom in de tuin.
The tree is lying in the garden.

Dutch pronunciation is pretty easy, but the ui, euij, au, ou, eeuw and uu sounds can be hard to make. Dutch speakers say only Germans learn to pronounce the ui correctly.

Dutch was listed as one of the top weirdest languages in Europe in a recent study.

Dutch is almost being buried in a flood of English loans. While this helps the English speaker, others worry that the Dutch nature of the language is at risk.

Dutch seems to be easier to learn than German. Dutch has fewer cases, thus fewer articles and and adjective endings. There are two main ways of pluralizing in Dutch: adding -‘s and adding -en. Unfortunately, in German, things are much more complex than that. Dutch has only two genders (and maybe just a trace of a third) but German definitely has three genders. Verb conjugation is quite similar in both languages, but it is a bit easier in Dutch. Word order is the same: complex in both languages. Both languages are equally complex in terms of pronunciation. Both have the difficult ø and y vowels.

Dutch gets a 3 rating, average difficulty.

Afrikaans is just Dutch simplified.

Where Dutch has 200 irregular verbs, Afrikaans has only six. A Dutch verb has six different forms, but Afrikaans has only two. Afrikaans has two fewer tense than Dutch. Dutch has two genders, and Afrikaans has only one. Surely Afrikaans ought to be easier to learn than Dutch.

Afrikaans gets a 2 rating, very easy to learn.

North Germanic
West Scandinavian

Icelandic is very hard to learn, much harder than Norwegian, German or Swedish. Part of the problem is pronunciation. The grammar is harder than German grammar, and there are almost no Latin-based words in it. The vocabulary is quite archaic. Modern loans are typically translated into Icelandic equivalents rather than borrowed fully into Icelandic.

There are four cases: nominative, accusative, dative and genitive – as in German – and there are many exceptions to the case rules, or “quirky case,” as it is called. In quirky case, case can be marked on verbs, prepositions and and adjectives. The noun morphology system is highly irregular. Articles can be postfixed and inflected and added to the noun. In fact, Icelandic in general is highly irregular, not just the nouns.

Verbs are modified for tense, mood, person and number, as in many other IE languages (this is almost gone from English). There are up to ten tenses, but most of these are formed with auxiliaries as in English. Icelandic also modifies verbs for voice – active, passive and medial. Furthermore, there are four different kinds of verbs – strong, weak, reduplicating and irregular, with several conjugation categories in each division.  Many verbs just have to be memorized.

Adjectives decline in an astounding 130 different ways, but many of these forms are the same.

The language is generally SVO, but since there is so much case-marking, in poetry all possibilities – SVO, SOV, VSO, VOS, OSV and OVS – are allowed. There is also something odd called “long distance reflexives,” which I do not understand.

In addition, Icelandic has the typical Scandinavian problem of a nutty orthography.

Icelandic verbs are very regular but the sounds change so much, especially the vowels, that the whole situation gets confusing pretty fast. In addition, there are three different verbal paradigms depending on the ending of the verb:


Icelandic verbs are commonly cited as some of the hardest verb systems around, at least in Europe. Even Icelandic people say their own verbs are difficult.

Icelandic has a voiceless lateral l. This can be a hard sound to make for many learners, especially in the middle of a word. In addition, there are two alveolar trills (the rolled r sound in Spanish), and one of them is voiced while the other is voiceless. Learners say they have problems with both of these sounds. In addition to voiceless l‘s and r‘s, Icelandic also has four voiceless nasals – , , ɲ̊, and ŋ̊ – the n, m, ny (as in Spanish nina), and ng sounds.

There are also contrasts between aspirated and nonaspirated stops including the odd palatal stops and c. In addition, there is a strange voiceless palatal fricative ç (similar to the h in English huge). In addition, Icelandic has a hard to pronounce four consonant cluster strj- that occurs at the beginning of a word.

Icelandic does have the advantage of being one of the few major languages with no significant dialects, so this is a plus. Icelandic has been separated from the rest of Scandinavian for 1,100 years. Icelandic is spoken over a significant region, much of which has inhabited places separated by large expanses of uninhabitable land such as impassable glaciers, volcanoes, lava flows,  geysers and almost no food. How Icelandic managed to not develop dialects in this situation is mysterious.

Icelandic has traditionally been considered to be one of the hardest languages on Earth to learn.

Icelandic gets a 5 rating, extremely difficult to learn.

Faroese is said to be even harder to learn than Icelandic, with some very strange vowels not found in other North Germanic languages.

Faroese has strong, weak and irregular verbs. It also has a strange supine tense.

The Faroese orthography is as irrational as Icelandic’s. There are so many rules to learn to be able to write Faroese properly. Faroese, like Icelandic, prefers to coin new words rather than borrow words wholesale into its language. Therefore the English speaker will not see a lot of obvious borrowings to help them out. Some argue against this nativization process, but maybe it is better than being buried in English loans like German and Dutch are at the moment.

computertelda (derived from at telja – to count. Icelandic has a similar term.
helicoptertyrla (derived from tyril – a spinning tool for making wool or loom.
pocket calculator
telduhvølpur (Lit. computer puppy), roknimaskina (Lit. calculating machine)

Faroese has the advantage of having no verbal aspect, and verbal declension does not differ much according to person. However, Faroese has a case system like Icelandic.

Faroese gets a 5 rating,extremely difficult.

Norwegian is fairly easy to learn, and Norwegian is sometimes touted as the easiest language on Earth to learn for an English speaker.

This is confusing because Danish is described below as a more difficult language to learn, and critics say that Danish and Norwegian are the same, so they should have equal difficulty. But only one Norwegian writing system is almost the same as Danish the Danish writing system. Danish pronunciation is quite a bit different from Norwegian, and this is where the problems come in.

Even Norwegian dialects can be a problem. Foreigners get off the plane having learned a bit of Norwegian and are immediately struck by the strangeness of the multiplicity of dialects, which for the most part are easy for Norwegians to understand but can be hard for foreigners. Norwegians often only understand their many dialects due to bilingual learning and much exposure and there are definitely Norwegian dialects that even Norwegians have a hard time understand like Upper and Lower Sogn and Trondnersk.

There is also the problematic en and et alternation, as discussed with Danish. Norwegian has an irrational orthographic system, like Swedish, with silent letters and many insensible sounds, both consonants and vowels. It has gone a long time without a spelling reform. It has the additional orthographic issues of two different writing systems and a multitude of dialects. Norwegian, like Danish and Swedish, has a huge vowel inventory, one of the larger ones on Earth. It can be confusing and difficult to make all of those odd vowel sounds: 18 contrasting simple vowels, nine long and nine short , , ɛː, ɑː, , , ʉ̟ː, , øː, ɪ, ɛ, a, ɔ, ʊ, ɵ, ʏ and œ.

Norwegian has very little inflection in its words, but the syntax is very difficult. Norwegian also has “tonemes” which distinguish between homophones.

tankenthe tank
the thought

have two different meanings, even though the stress and pronunciation are the same. The words are distinguished by a toneme.

For some reason, Norwegian scored very high on a study of weirdest languages on Earth, but Swedish and Danish also got high scores.

However, Norwegian is a very regular language.

Norwegian gets a 2 rating, moderately easy to learn.

East Scandinavian

Danish is a harder language to learn than one might think. It’s not hard to read or even write, but it’s quite hard to speak. However, like English, Danish has a non-phonetic orthography, so this can be problematic. It has gone a long time without a spelling reform, so there are many silent letters and sounds, both vowels and consonants, that make no sense. Danish makes it on lists of most irrational orthographies of all.

In addition, there are d words where the d is silent and other d words where it is pronounced, and though the rules are straightforward, it’s often hard for foreigners to get the hang of this. The d in hund is silent, for instance. In addition, the b, d, and g sounds are somehow voiceless in many environments. There are also the strange labiodental glide and alveopalatal fricative sounds. In certain environments, d, g, v, and r turn into vowels.

There are three strange vowels that are not in English, represented by the letters æ, ø and å. They are all present in other Scandinavian languages – æ is present in Icelandic and Norwegian, ø is part of Norwegian, and å is part of Norwegian and Swedish, but English speakers will have problems with them. In addition, Danish has creaky-voiced vowels, which is very strange for an IE language. Danish language learners often report having a hard time pronouncing Danish vowels or even telling one apart from the other. Danish makes it onto lists of the wildest phonologies on Earth,and it made it high on a list of weirdest languages on Earth.

One advantage of all of the Scandinavian languages is that their basic vocabulary (the vocabulary needed to converse at a basic level and be understood) is fairly limited. In other words, without learning a huge number of words, it is possible to have a basic conversation in these languages. This is in contrast to Chinese, where you have to learn a lot of vocabulary just to converse at a basic level.

As with Maltese and Gaelic, there is little correlation between how a Danish word is written and how it is pronounced.

Pronunciation of Danish is difficult. Speech is very fast and comes out in a continuous stream that elides entire words. Vowels in the middle and at the end of words are seldom expressed. There are nine vowel characters, and each one can be pronounced in five or six different ways. There is nearly a full diphthong set, and somehow pharyngealization is used as an accent. Danish has a huge set of vowels, one of the largest sets on Earth. The sheer number of vowels is one reason that Danish is so hard to pronounce. Danish has 32 vowels, 15 short, 13 long and four unstressed: ɑ, ɑː, a, æ, æː, ɛ, ɛː, e, e̝ː, i, , o, , ɔ, ɔː, u, , ø, øː, œ, œː, ɶ, ɶː, y, , ʌ, ɒ, ɒː, ə, ɐ, ɪ, and ʊ.

There is also a strange phonetic element called a stød, which is a very short pause slightly before the vowel(s) in a word. This element is very hard for foreigners to get right.

Just about any word has at least four meanings, and can serve as noun, verb, adjective or adverb. Danish has two genders (feminine and masculine have merged into common gender), and whether a noun is common or neuter is almost impossible to predict and simply must be memorized.

Suggesting that Danish may be harder to learn than Swedish or Norwegian, it’s said that Danish children speak later than Swedish or Norwegian children. One study comparing Danish children to Croatian tots found that the Croat children had learned over twice as many words by 15 months as the Danes. According to the study:

The University of Southern Denmark study shows that at 15 months, the average Danish toddler has mastered just 80 words, whereas a Croatian tot of the same age has a vocabulary of up to 200 terms.

[…] According to the study, the primary reason Danish children lag behind in language comprehension is because single words are difficult to extract from Danish’s slurring together of words in sentences. Danish is also one of the languages with the most vowel sounds, which leads to a ‘mushier’ pronunciation of words in everyday conversation.

Danish gets a 3 rating, average difficulty.

Swedish has the disadvantage of having hundreds of irregular verbs. Swedish also has some difficult phonemes, especially vowels, since Swedish has nine vowels, not including diphthongs. Pronunciation of the ö and å (and sometimes ä, which has a different sound) can be difficult. Swedish also has pitch accent. Pronunciation is probably the hardest part of Swedish.

Words can take either an -en or an –ett ending, and there don’t seem to be any rules about which one to use. The same word can have a number of different meanings.

Swedish, like German, has gender, but Swedish gender is quite predictable by looking at the word, unlike German, where deciding which of the three genders to use seems like a spin of the Roulette wheel.

Word order is comparatively free in that one can write a single sentence multiple ways while changing the meaning somewhat. So I didn’t know that. can be written the following ways:

Det visste jag inte.
Det visste inte jag.
Jag visste inte det.
Jag visste det inte.
Inte visste jag det.

For some reason, Swedish got a very high score on a study of the weirdest languages on Earth.

The different ways of writing that sentence depend on context. In particular, the meaning varies in terms of topic and focus.

There is a 3-way contrast in deixis:

den här
den där

Swedish also has the same problematic phrasal verbs that English does:

att slå -  beat/hit

slå av     turn off
slå fast   settle/establish
slå igen   close/shut
slå igenom become known/be a success
slå in     wrap in, come true
slå ner    beat down
slå på     turn on
slå runt   overturn
slå till   hit/strike/slap, strike a deal
slå upp    open (a book), look s.t. up

Swedish orthography is difficult in learning how to write it, since the spelling seems illogical, like in English. The sj sound in particular can be spelled many different ways. However, Swedish spelling is probably easier than English since Swedish lacks a phonemic schwa, and schwa is the source of many of the problems in English. Where allophonic schwa does appear, it seems to be predictable.

One nice thing about Swedish grammar is that it is similar to English grammar in many ways.

Swedish can be compared to a tube in terms of ease of learning. The basics are harder to learn than in English, but instead of getting more difficult as one progresses as in English, the difficulty of Swedish stays more or less the same from basics to the most complicated. But learning to speak Swedish is easy enough compared to other languages.

Swedish gets a 2.5 rating, easy to average difficulty.


Any Gaelic language is tough. Celtic languages are harder to learn than German or Russian.

Insular Celtic

Old Irish was the version of Irish written from 650 to 900 AD. It was used only by the educated and aristocratic elites. The rest of the population spoke a simplified version that was already on its way to becoming Middle Irish.

The verbal system in Old Irish was one of most complicated of all of the classical languages.

The persons were 1st, 2nd, 3rd and plural. The tenses were present, preterite, imperfect, perfect, future and an odd tense called secondary future. There were imperative and subjunctive moods. There was no infinitive – instead it was formed rather erratically as a verbal noun derived from the verb. This gerund underwent 10 different declensions and often looked little like the verb it is derived from.

cingidto step -> céimstepping

There were both strong and weak verbs, and each had both simple and compound forms.

Bizarrely, every verb had not one but two different paradigms – the conjunct and the absolute. You used the conjunct when the verb is preceded by a conjunct particle such as (not) or in (the question particle). You used the absolute when there was no conjunct particle in front of the verb.

Hence, the present indicative of glenaid (sticks fast), is:

Absolute   Conjunct

glenaim    :glenaim
glenai     :glenai
glenaid    :glen
glenmai    :glenam
glenthae   :glenaid
glenait    :glenat

The colon before the conjunct verbs indicates that a conjunct particle preceded the verb.

The phonological changes were some of the most complicated you could imagine. An attempt was made to orthographically portray all of these convoluted changes, but the orthography ended up a total mess.

Each consonant had four different values depending on where it was in the word and whether or not it was palatal. Hence, even though the 1st person absolute and conjunct look identical above (both are spelled glenaim), they were pronounced differently. The absolute was pronounced glyenum, and the conjunct was pronounced glyenuv.

The grammar was unbelievably complex, probably harder than Ancient Greek. There was even a non-IE substratum running underneath the language.

Old Irish gets a 5 rating, extremely difficult.

Irish students take Irish for 13 years, and some take French for five years. These students typically know French better than Irish. There are inflections for the inflections of the inflections, a convoluted aspiration system, and no words for yes or no. The system of initial consonant mutation is quite baffling. Noun declension is mystifying. Irish has irregular nouns, but there are not many of them:

the womanan bhean
the women
na mná

and there are only about 10 irregular verbs. There are dozens of different declension types for verbs. The various phonological gradations, lenitions and eclipses are not particularly regular. There are “slender” and “broad” variants of many of the consonants, and it is hard to tell the difference between them when you hear them. Many learners find the slender/broad consonants the hardest part of Irish. The orthography makes many lists of worst orthographies on Earth.

Irish gets a 4.5 ratings, very difficult.

Both Scots Gaelic and Irish Gaelic are written with non-phonetic spelling that is even more convoluted and irrational than English. This archaic spelling is in drastic need of revision, and it makes learners not want to learn the language. For instance, in Scots Gaelic, the word for taxi is tacsaidh, although the word is pronounced the same as the English word. There are simply too many unnecessary letters for too few sounds. Of the two, Scots Gaelic is harder due to many silent consonants.

Irish actually has rules for its convoluted spelling, and once you figure out the rules, it is fairly straightforward, as it is quite regular and it is actually rational in its own way. In addition, Irish recently underwent a spelling reform. The Irish spelling system does make sense in an odd way, as it marks things such as palatalization and velarization.

Scottish Gaelic and Manx have gone a long time with no spelling reforms.

Scottish Gaelic gets a 4.5 ratings, very difficult.

Manx is probably the worst Gaelic language of all in terms of its spelling since it has Gaelic spelling yet uses an orthography based on English which results in a crazy mix that makes many lists of worst scripts.

Manx gets a 4.5 rating, very difficult.

Common Byrthonic

Welsh is also very hard to learn, although Welsh has no case compared to Irish’s two cases. And Welsh has a mere five irregular verbs. The Byrthonic languages like Welsh and Breton are easier to learn than Gaelic languages like Irish and Scots Gaelic. One reason is because Welsh is written with a logical, phonetic alphabet. Welsh is also simpler grammar-wise, but things like initial consonant mutations can still seem pretty confusing and are difficult for the non-Celtic speaker to master and understand. Verbal declension is irregular.

caraf   I love
carwn   we love

cerais  I loved
carasom we loved

The problem above is that one cannot find any morpheme that means 1st person, 3rd person, or past tense in the examples. Even car- itself can change, and in connected speech often surfaces as gar-/ger-. And carwn can mean I was loving (imperfect) in addition to we love. There are no rules here, and you simply have to memorize the different forms.

Welsh gets a 4 rating, very hard to learn.

Breton is about in the same ballpark as Welsh. It has a flexible grammar, a logical orthography and only four irregular verbs.

On the other hand, there are very few language learning materials, and most of those available are only written in French.

Breton gets a 4 rating, very hard to learn.


Greek is a difficult language to learn, and it’s rated the second hardest language to learn by language professors. It’s easy to learn to speak simply, but it’s quite hard to get it down like a native. It’s the rare second language learner who attains native competence. Like English, the spelling doesn’t seem to make sense, and you have to memorize many words. Further, there is the unusual alphabet. However, the orthography is quite rational, about as good as that of Spanish. Whether or not Greek is an irregular language is controversial. It has that reputation, but some say it is not as irregular as it seems.

Greek has four cases: nominative, accusative, genitive and vocative (used when addressing someone). There are three genders: masculine, feminine and neuter. Nouns have several different declension patterns determined by the ending on the noun. Verb conjugations are about as complicated as in Romance. Greek does retain the odd aorist tense. In addition, it has the odd middle voice and optative mood. Greek syntax is quite complicated.

Greek gets a 5 rating, extremely difficult to learn.

Classic or Ancient Greek was worse, with a distinction between aspirated and unaspirated consonants, a pitch accent system and a truly convoluted, insanely irregular system of noun and verb inflection. It had a dual number in addition to singular and plural and a very difficult optative case. Irregular verbs had one of six different stem types. The grammar was one of the most complex of all languages, and the phonology and morphology were truly convoluted.

Ancient Greek is said to have had four different genitive cases, but it actually had four different uses of the genitive:

  1. Objective Genitive – “for obedience to faith”
  2. Subjective Genitive – “faith’s obedience” or faithful obedience
  3. Attributive Genitive – “obedience of faith”
  4. Genitive of Apposition – obedience, i.e. faith

How confusing!

Classic Greek gets a 5.5 rating, nearly hardest of all to learn.


An  obscure branch of Indo-European, Armenian, is very hard to learn. Armenian is a difficult language in terms of grammar and phonetics, not to mention the very odd alphabet. The orthography is very regular, however there are some irregularities. For instance:

գրել , written grel but spoken gərel (schwa removed in orthography)
խոսել, written xosel but spoken xosal  (a changed to e in orthography)

However, the alphabet itself presents many problems. Print and cursive can be very different, and upper case and lower case can also be quite different. Here are some pairs of letters in upper and lower case:

Ա ա
Յ յ
Փ փ

All in all, this means you have to memorize as many as four different shapes for each letter. However, the grammar is very regular.

In addition, many letters very closely resemble other letters, which makes it very easy to get them mixed up:

գ and զ
and է
and ղ
and ռ

There are voiced consonants and an alternation between aspirated and unaspirated unvoiced consonants, so some mix up the forms for b, p and , for instance. Nevertheless, there are many things about the grammar that seem odd compared to other IE languages. For instance, Armenian has agglutination, and that is a very strange feature for an IE language.

Part of the problem is that due to its location in the Caucasus, Armenian has absorbed influences from some of the wild nearly Caucasian languages. For instance, an extinct NE Caucasian Nakh language called Tsov is thought to have contributed to the Hurro-Ururtian substratum in Armenian. So in a sense when you learn Armenian, you are also learning a bit of Chechen at the same time. For some reason, Armenian scored very high on a weirdest languages survey.

People who have learned both Arabic and Armenian felt that Armenian was much easier, so Armenian seems to be much easier than Arabic.

Armenian is rated 4, very hard to learn.


Albanian is another obscure branch of Indo-European. Albanian nouns have two genders (masculine and feminine), five cases including the ablative, lost in all other IE. Both definite and indefinite articles are widely used, a plus for English speakers. Most inflections were lost, and whatever is left doesn’t even look very IE. The verbal system is complex, having eight tenses including two aorists and two futures, and several moods, including indicative, imperative, subjunctive, conjunctive, optative and admirative. The last three are odd cases for IE. The optative only exists in IE in Ancient Greek, Sanskrit and Manx. Oddly enough, there is no infinitive. Active and passive voices are used.

Similarly to Gaelic, Albanian is even harder to learn than either German or Russian. Albanian may be even harder to learn than Polish.

Albanian is rated 5,extremely difficult.


All Slavic languages have certain difficulties. For instance, the problematic perfect/imperfect tenses discussed below in Czech and Slovak are present in all of Slavic. The animate/inanimate noun class distinction is present in all of Slavic also. Slavic languages also add verb prefixes to verbs, completely changing the meaning of the verb and creating a new verb (see Italian above).

East Slavic

People are divided on the difficulty of Russian, but language teachers say it’s one of the hardest to learn. Even after a couple of years of study, some learners find it hard to speak even a simple sentence correctly.

It has six basic cases – nominative, accusative, dative, genitive, instrumental and prepositional – and analyses have suggested up to 10 other cases. The most common of the extra cases are locative, partitive and several forms of vocative. All of these extra cases either do not apply to all nouns (“incomplete” cases) or seem to be identical to an existing case. At any rate, the vocative is only used in archaic prose. And there is also a locative case, which is what the exceptions to the prepositional case are referred to. Russian has two genitive cases, the so-called Genitive 1 and Genitive 2. The first one is standard genitive and the second is the genitive-partitive (see above), which is now only used in archaic prose.

The grammar is fairly easy for a Slavic language. The problem comes with the variability in pronunciation. The adjectives and endings can be difficult. In addition, Russian has gender and lots of declensions. Like Lithuanian, almost everything in the language seems to decline. The adjectives change form if the nouns they describe have different endings. Adjectives also take case somehow.

Verbs have different forms depending on the pronouns that precede them. Russian has the same issues with perfective and imperfective forms as Polish does (see the Polish section below). There are dozens of different declension types for verbs and many verbs that are irregular and don’t fit into any of the declension types. In addition, there are many irregular nouns, syncretisms, and an aspectual system that is morphologically unpredictable.

Word order is pretty free. For instance, you can say:

I love you by saying

I love you.
You love I.
Love you I.
I you love.
Love I you.
You I love

Pronunciation is strange, with one vowel that is between an ü and i. Many consonants are odd, and every consonant has a palatalized counterpart, which will be difficult to speakers whose languages lack phonemic palatalized consonants. These are the soft and hard consonants that people talk about in Russian. The bl sound is probably the hardest to make, but the trilled r is also problematic.

Russian has several words that, bizarrely, are made up of only a single consonant:

s with, off of
to, towards
in, into
– subjunctive/conditional mood particle (would)
Z – emphatic particle

In addition, Russian has some very strange words that begin with a doubled consonant sound:


The orthography system is irregular, so there are quite a few silent letters and words that are pronounced differently than they are spelled.

Word Silent Letters Example
здн  [знпраздник
рдц  [рцсердце
лнц  [нцсолнце
стн  [снлестница
вств [ств]          чувство
жч   [щ]            мужчина
зч   [щ]            извозчик
сч   [щ]            счастье
чт   [штчто
чн   [шнконечно
тц   [ц]            вкратце
дц   [ц]            двадцать
тч   [ч]            лётчик
дч   [ч]            докладчик
тся  [цца]          учится
ться [цца]          учиться

Stress is quite difficult in Russian since it seems arbitrary and does not appear to follow obvious rules:

дóмаat home

One problem is that phonemic stress, not written out, changes the way the vowel is pronounced. For instance:

узнаюI’m finding out
I will find out

The two are written identically, so how you tell them apart in written Russian, I have no idea. However in speech you can tell one from the other because the two forms have different stress.

Russian also has vowel reduction that is not represented in the orthography. The combination of stress and vowel reduction means that even looking at a Russian word, you are not quite sure how to pronounce it.

Like German, Russian builds morphemes into larger words. Again like German, this is worse than it sounds since the rules are not so obvious. In addition, there is the strange Cyrillic alphabet, which is nevertheless easier than the Arabic or Chinese ones. Russian also uses prepositions to combine with verbs to form the nightmare of phrasal verbs, but whereas English puts the preposition after the verb, Russian puts it in front of the verb.

All of Slavic has a distinction between animate and inanimate nouns as a sort of a noun class. Russian takes it further and even has a distinction between animate and inanimate pronouns in the male gender:

dvoje muzhchin     two men
troje muzhchin     three men
chetvero muzhchin  four men
pyatero muzhchin   five men
shestero muzhchin  six men
semero muzhchin    seven men

Compare to:

dva duba      two oaks 
tri duba      three oaks 
chetyre duba  four oaks

However, Russian only has the animate/inanimate distinction in pronouns and not in nouns in general.

Like Polish below, you use different verbs depending if you are going somewhere on foot or other than on foot. Second there is a distinction between going somewhere with a goal in mind and going somewhere with no particular goal in mind. For instance, to go:

idti (by foot, specific endpoint)
xodit’ (by foot, no specific endpoint)
exat’ (by conveyance, specific endpoint)
ezdit’ (by conveyance, no specific endpoint)

The verb to carry also has four different forms with the same distinctions as above.

In addition, there are various prefixes you can put on a verb:

into                  v-
out of                vy-
towards               po-
away from             u-
up to the edge of     pod-
away from the edge of ot-
through               pro-
around                ob-

These prefixes look something like “verbal case.” You an add any of those prefixes to any of the going or carrying verbs above. Therefore, you can have:

poiti  –walk up to something
drive around with no goal
–  walk away from something with no goal in mind

The combination of paths and goals results in some very specific motion verbs.

Russian is harder to learn than English. We know this because Russian children take longer to learn their language than English speaking children do. The reason given was that Russian words tended to be longer, but there may be other reasons.

Russian has the advantage of having quite a bit of Romance and Greek loans for a Slavic language, but unfortunately, you will not typically hear these words in casual conversion. Russian also has no articles. English speakers will find this odd, but others regard it as a plus.

Russian is less difficult than Czech, Polish or Serbo-Croatian.

Russian gets a 4 rating, very hard to learn.

West Slavic
Czech and Slovak

Czech and Slovak are notoriously hard to learn; in fact, all Slavic languages are. Language professors rate the Slavic languages the third hardest to learn on Earth. Czech is in the Guinness Book of World Records as the hardest language to learn. Even the vast majority of Czechs never learn to speak their language correctly. They spend nine years in school studying Czech grammar, but some rules are learned only at university. Immigrants never seem to learn Czech well, however, there are a few foreigners who have learned Czech very well – say, three or fewer errors in a 30 minute monologue, so it is possible to learn Czech well even if it is not very common.

Writing Czech properly is even more difficult than speaking it correctly, so few Czechs write without errors. In fact, an astounding 1/3 of the population makes at least on grammatical or spelling mistake in every sentence they write! The younger generation is now even worse as far as this goes, as Czech language teaching for natives has become more lax in recent years and drills have become fewer. Nevertheless, the Czech and Slovak orthographies are very rational. There is nearly a 1-1 sound/symbol correspondence.

Even natives often mess up the conditional (would). The 3rd conditional (past conditional) has nearly gone out of modern Czech and has merged with the present conditional:

3rd conditional – If I “would have known” it, I would not have asked has merged with
2nd conditional – If I “would know” it, I would not ask.

This means conditional events in the present are no longer distinguished between those in the past, and the language is impoverished.

Native speakers also mix up a specific use of the gerund:


She looked at me smiling.
He walked along whistling.
He was in his bed reading a book.

This is easy to say in English, and the use of these forms is rather common. However, it is very hard to make those sentences in Czech, and possibly only 3% of the population can formulate those sentences properly. Instead, they break them up into two sentences:


She looked at me, and she smiled.
He was in his bed, and he was reading.

Czech is full of exceptions and exceptions to the exceptions. It is said that there are more exceptions than there are rules. Czech has seven cases in singular and seven more cases in plural for nouns, for a total of 59 different “modes” of declension. There are also words that swing back and forth between “modes.” Adjectives and pronouns also have seven cases in the singular and plural. Czech is one of the few languages that actually has two genitive cases – one more or less possessive and the other more or less partitive. There are six genders, three in the singular and three in the plural.

When you put all that together, each noun can decline in 59 different ways. Further, these 59 different types of nouns each have 14 different forms depending on case. Verbs also decline. The verbs have both perfective and imperfective and have 45 different conjugation patterns. Czech learners often confuse the perfect and imperfect verbs. Verbs of motion can also be quite tricky.

One of the problems with Czech is that not only nouns but also verbs take gender, but they only do so in the past tense. In addition, Czech has a complicated aspect system that is often quite irregular and simply must be memorized to be learned.

This conjugation is fairly regular:

viděl continuous past – he saw
punctual – once he suddenly saw
repetitive – he used to see (somebody/something) repeatedly

Others are less regular:

jedl continuous – he ate
snědl dojedl
he ate it all up
he ate a bit of it
he finished eating
repetitive – he used to eat repeatedly

Czech also has an evidential system. The particle prý is used to refer to hearsay evidence that you did not personally witness.

Prý je tam zima.
Someone said/People say it’s cold outside.

Truth is that almost every word in the language is subject to declension. The suffixes on nouns and verbs change all the time in strange ways.

There are some difficult consonants such as š, č, ť, ž, ľ, ď, dz, , ĺ and ŕ. It’s full of words that don’t seem to have vowels.

Entire Czech sentences can have extreme consonant clusters that appear to lack vowels:

Strč prst skrz krk.
Stick a finger through your neck.

Smrž pln skvrn zvlhl z mlh.
A morel full of spots welted from fogs…

Mlž pln skvrn zvh.

However, the letters r and l are considered “half-vowels” in Czech, so the sentences above are easier to pronounce than you might think.

The letters ř and r (Czech has contrasting alveolar trills) are hard to pronounce, and ř is often said to exist in no longer language, including other Slavic languages. It is only found in one other language on Earth –  the Papuan language Kobon, which pronounces it a bit differently. Even Czechs have a hard time making these sounds properly (especially the ř), and many L2 speakers never get them right. There is also a hard and soft i which is hard to figure out.

As with other Slavic languages like Russian, it has the added problem of fairly loose word order. In addition, there are significant differences between casual and formal speech where you use different forms for someone you are familiar with (are on a first name basis with) as opposed to someone you do not know well. In addition, females use different endings for the past tense than men do.

On the plus side, Czech stress, like that of Polish, is regular as the accent is always on the first syllable. But if you come from a language such as Spanish where the accent is typically on the second syllable, this might present an obstacle.

Czech gets a 5.5 rating, nearly hardest of all.

Slovak is closely related to Czech, and it is controversial which one is harder to learn. Slovak is definitely more archaic than Czech. Some say that Slovak is easier because it has a more regular grammar. Slovak has the additional problem is marking acute accents: á, é, í, ĺ, ó, ŕ, ú and ý. Slovak fortunately lacks the impossible Czech ř sound. Instead it has something called a “long r,” (ŕ) which is not very easy to make either. This is something like the er sound in English her.

Slovak, like Czech, has retained the vocative, but it almost extinct as it is restricted to only a few nouns. Like Polish and Sorbian, Slovak also has an animate/inanimate distinction in gender for plural nouns. So Slovak has five genders: masculine, feminine and neuter in the singular and animate and inanimate in the plural.

Some say that Slovak is even harder than Polish, and there may be a good case that Czech and Slovak are harder than Polish.

Slovak gets a 5.5 rating, nearly hardest of all.


Polish is similar to Czech and Slovak in having words that seem to have no vowels, but in Polish at least there are invisible vowels. That’s not so obviously the case with Czech. Nevertheless, try these sentences:

  1. Wszczebrzeszynie chrząszcz brzmi w trzcinie i Szczebrzeszyn z tego słynie.
  2. Wyindywidualizowaliśmy się z rozentuzjazmowanego tłumu.
  3. W Szczebrzeszynie chrząszcz brzmi w trzcinie.

I and y, s and z, je and ě alternate at the ends of some words, but the rules governing when to do this, if they exist, don’t seem sensible. The letter ť is very hard to pronounce. There are nasal vowels as in Portuguese. The ą, ć, ę, ł, ń, ó, sz, cz, dz, , sounds are hard for foreigners to make. There are sounds that it is even hard for native speakers to make as they require a lot tongue movements. A word such as szczescie is hard to Polish L2 speakers to pronounce. Polish written to spoken pronunciation makes little sense, as in English – h and ch are one sound – h, ó and u are the same sound, and u may form diphthongs where it sounds like ł, so u and ł can be the same sound in some cases.

The confusing distinction between h/ch has gone of most spoken Polish. Furthermore, there is a language committee, but like the French one, it is more concerned with preserving the history or the etymology of the word and less with spelling the word phonemically. Language committees don’t always do their jobs!

Polish orthography, while being regular, is very complex. Polish uses a Latin alphabet unlike most other Slavic languages which use a Cyrillic alphabet. The letters are: A Ą B C Ć D E Ę F G H I J K L Ł M N Ń O Ó Q P R S T U V W X  Y Z Ź Ż. Even Poles say that their orthography is very complicated.

Polish is even complex in terms of pronunciation. There are apparently rules for regarding comma use, but the rules are so complex that even native speakers can’t make sense of them.

Further, native speakers speak so fast it’s hard for non-natives to understand them. Due to the consonant-ridden nature of Polish, it is harder to pronounce than most Asian languages. Listening comprehension is made difficult by all of the sh and ch like sounds. Furthermore, since few foreigners learn Polish, Poles are not used to hearing their language mangled by second-language learners. Therefore, foreigners’ Polish will seldom be understood.

Polish grammar is said to be more difficult than Russian grammar. Polish has the following:

There are five different tenses: zaprzeszły, przeszły, teraźniejszy, przyszły prosty, and przyszły złozony.

There are seven different genders: masculine animate, masculine inanimate, feminine, and neuter in the singular and animate and inanimate in the plural. However, masculine animate and masculine inanimate and the plural genders are only distinguished in accusative. Masculine animate, masculine inanimate and neuter genders have similar declensions; only feminine gender differs significantly.

Masculine nouns have five patterns of declension, and feminine and neuter nouns have six different patterns of declension. Adjectives have two different declension patterns. Numbers have five different declension patterns: główne, porządkowe, zbiorowe, nieokreślone, and ułamkowe. There is a special pattern for nouns that are only plural.

There are seven different cases: nominative, genitive, dative, accusative, instrumental, locative, vocative. Only the genitive locative cases are irregular, the latter only in the singular. Verbs have nine different persons in their declensions: ja, ty, on, ona, ono, my, wy, oni, one. There are different conjugation patterns for men and women. There are 18 different conjugation patterns in the verb (11 main ones). There are five different polite forms: for a man, a woman, men, women and men and women combined.

There are four different participle forms, three of which inflect. Some of these are active and others are passive, but the whole system is incredibly complex. All of the participles decline like nouns, each gender adds its bit to each pattern which in turn change more according to tense.

Polish has seven cases, including the vocative which has gone out of most Slavic. The vocative is often said to be dying out, becoming less common or only used in formal situations, but the truth is that it is still commonly used.

In an informal situation, a Pole might be more like to use nominative rather than vocative:

Cześć Marek! (Nom.), rather than
Cześć Marku! (Voc.)

However, in a more formal situation, the vocative is still likely to be used:

Dzień dobry panie profesorze/doktorze! (Voc.). Dzień dobry pan profesor/doktor! (Nom.) would never be used, even in casual conversation.

Case declension is very irregular, unlike German. Polish consonant gradation is called oboczność (variation).

The genders of nouns cause the adjectives modifying them to inflect differently.

matka    mother (female gender)
ojciec   father (male gender)
dziecko  child (neuter gender)

Modifying Adjective
brzydkiugly ugly

brzydka matka     ugly mother
brzydki ojciec    ugly father
brzydkie dziecko  ugly child

brzydkie matki    ugly mothers
brzydcy ojcowie  ugly fathers
brzydkie dzieci   ugly children

Gender even effects verbs.

I ate (female speaker) Ja zjadłam
I ate (male speaker)   Ja zjadłem

There are two different forms of the verb kill depending on whether the 1st person singular and plural and 2nd person plural killers are males or females.

I killed     zabiłem/zabiłam
We killed    zabiliśmy/zabiłyśmy
They killed  zabili/zabiły

The perfective and imperfective tenses create a dense jungle of forms:

kupować - to buy

Singular  Simple Past         Imperfect
I (f.)    kupiłam             kupowałam
I (m.)    kupiłem             kupowałem
you (f.)  kupiłaś             kupowałaś
you (m.)  kupiłeś             kupowałeś
he        kupił               kupował
she       kupiła              kupowała
it        kupiło              kupowało

we (f.)   kupiłyśmy           kupowałyśmy
we (m.)   kupiliśmy           kupowaliśmy
you (f.)  kupiłyście          kupowałyście 
you (m.)  kupiliście          kupowaliście
they (f.) kupiły              kupowały
they (m.) kupili              kupowali

The verb above forms an incredible 28 different forms in the perfect and imperfect past tense alone.

The existence of the perfective and imperfective verbs themselves is the least of the problem. The problem is that each verb – perfective or imperfective – is in effect a separate verb altogether, instead of just being conjugated differently.

The verb to see has two completely different verbs in Polish:


WidziałemI saw (repeatedly in the past, like I saw the sun come up every morning).
ZobaczyłemI saw (only once; I saw the sun come up yesterday).

Some of these verbs are obviously related to each other:


But others are very different:


This is not a tense difference – the very verbs themselves are different! So for every verb in the language, you effectively have to learn two different verbs. The irregular forms may date from archaic Polish.

In addition, the future perfect and future imperfect often conjugate completely differently, though the past forms usually conjugate in the same way – note the -em endings above. There is no present perfect as in English, since in Polish the action must be completed, and you can’t be doing something at this precise moment and at the same time have just finished doing it. 95% of verbs have these maddening dual forms, but for 5% of verbs that lack a perfective version, you only have one form.

It’s often said that one of the advantages of Polish is that there are only three tenses, but this is not really case, as there are at least eight tenses:

Indicative         grac       to play
Present            gram       I play 
Past               gralem     I played
Conditional        gralbym    I would play
Future             będę grać  I will play
Continuous future  będę grał  I will be playing
Perfective future  bogram     I will have played*
Perf. conditional  pogralbym  I would have played

*Implies you will finish the action

There is also an aspectual distinction made when referring to the past. Different forms are used based on whether or not the action has been completed.

Whereas in English we use one word for go no matter what mode of transportation we are using to get from one place to another, in Polish, you use different verbs if you are going by foot, by car, by plane, by boat or by other means of transportation.

In addition, there is an animate-inanimate distinction in gender. Look at the following nouns:

hat      kapelusz
computer komputer
dog      pies
student  uczen

All are masculine gender, but computer and hat are inanimate, and student and dog are animate, so they inflect differently.

I see a new hatWidze nowy kapelusz
I see a new student
Widze nowego ucznia

Notice how the now- form changed.

In addition to completely irregular verbs, there are also irregular nouns in Polish:

człowiek -> ludzie

Let us look at pronouns. English has one word for the genitive case of the 1st person singular – my. In Polish, depending on the context, you can have the following 11 forms, and actually there are even more than 11:


Numerals can be complex. English has one word for the number 2 – two. Polish has 21 words for two, and  all of them are in common use.

dwa (nominative non-masculine personal male and neuter and non-masculine personal accusative)
dwaj (masculine personal nominative)
dwie (nominative and accusative female)
dwóch (genitive, locative and masculine personal accusative)
dwom (dative)
dwóm (dative)
dwu (alternative version sometimes used for instrumental, genitive, locative and dative)
dwoma (masculine instrumental)
dwiema (female instrumental)
dwoje (collective, nominative + accusative)
dwojga (collective, genitive)
dwojgu (collective, dative + locative)
dwójka (noun, nominative)
dwójkę (noun, accusative)
dwójki (noun, genitive)
dwójce (noun, dative and locative)
dwójką (noun, instrumental)
dwójko (vocative)
dwojgiem (collective, instrumental)

Polish also has the paucal form like Serbo-Croatian. It is the remains of the old dual. The paucal applies to impersonal masculine, feminine and neuter nouns but not to personal masculine nouns.

Personal Masculine

one boy     jeden chłopiec
two boys    dwóch chłopców
three boys  trzech chłopców
four boys   czterech chłopców
five boys   pięciu chłopców
six boys    sześciu chłopców
seven boys  siedmiu chłopców
eight boys  ośmiu chłopców

Impersonal Masculine

one dog     jeden pies
two dogs    dwa psy
three dogs  trzy psy
four dogs   cztery psy
five dogs   pięć psów
six dogs    sześć psów
seven dogs  siedem psów
eight dogs  osiem psów

In the above, two, three and four dogs is in the paucal (psy), while two, three or four men is not and is instead in the plural (chłopców)

A single noun can change in many ways and take many different forms. Compare przyjacielfriend

                             Singular         Plural
who is my friend             przyjaciel       przyjaciele
who is not my friend         przyjaciela      przyjaciół
friend who I give s.t. to    przyjacielowi    przyjaciołom
friend who I see             przyjaciela      przyjaciół
friend who I go with         z przyajcielem   z przyjaciółmi
friend who I dream of        o przyjacielu    o przyjaciołach
Oh my friend!                Przyajcielu!     Przyjaciele!

There are 12 different forms of the noun friend above.

Plurals change based on number. In English, the plural of telephone is telephones, whether you have two or 1,000 of them. In Polish, you use different words depending on how many telephones you have:

two, three or four telefony, but
five telefonów.

Sometimes, this radically changes the word, as in hands:

four ręce, but
five rąk.

There are also irregular diminutives such as

psiaczek  -> słoneczko

Polish seems like Lithuanian in the sense that almost every grammatical form seems to inflect in some way or other. Even conjunctions inflect in Polish.

In addition, like Serbo-Croatian, Polish can use multiple negation in a sentence. You can use up to five negatives in a perfectly grammatical sentence:

Nikt nikomu nigdy nic nie powiedział.
Nobody ever said anything to anyone

Like Russian, there are multiple different ways to say the same thing in Polish. However, the meaning changes subtly with these different word combinations, so you are not exactly saying the same thing with each change or word order. Nevertheless, this mess does not seem to be something that would be transparent to the Polish learner.

In English, you can say Ann has a cat, but you can’t mix the words up and mean the same thing. In Polish you can say Ann has a cat five different ways:

Ania ma kota.
Kota ma Ania.
Ma Ania kota.
Kota Ania ma.
Ma kota Ania.

The first one is the most common, but the other four can certainly be used. The truth that while the general meaning is the same in each sentence, the deep meaning changes with each sentence having a slightly different nuanced interpretation.

In addition, Polish has a wide variety of dialects, and a huge vocabulary. Although Polish grammar is said to be irregular, this is probably not true. It only gives the appearance of being irregular as there are so many different rules, but there is a method to the madness underneath it all. The rules themselves are so complex and numerous that it is hard to figure them all out.

Polish appears to be more difficult than Russian. For example, in Russian as in English, the 1st through 3rd person past tense forms are equivalent, whereas in Polish, they are each different:

          English   Russian     Polish

1st past  I went    ya pashou   ja poszedłem 
2nd past  you went  ty pashou   ty poszedłeś
3rd past  he went   on pashou   on poszedł

Even adult Poles make a lot of mistakes in speaking and writing Polish properly. However, most Poles are quite proud of their difficult language (though a few hate it) and even take pride in its difficult nature.

On the positive side, in Polish, the stress is fixed, there are no short or long vowels nor is there any vowel harmony, there are no tones and it uses a Latin alphabet.

Polish is one of the most difficult of the Slavic languages. Even Poles say it is very hard to learn. Most Poles do not learn to speak proper Polish until they are 16 years old! Although most Poles know how to speak proper Polish, they often use improper forms when speaking formally, not because they do not know how to speak correctly but simply because they feel like

It is harder than Russian and probably also harder than Czech, though this is controversial. There is a lot of controversy regarding which is harder, Czech or Polish.

Polish gets a 5 rating, extremely difficult.

South Slavic

It’s controversial whether Bulgarian is an easy or hard language to learn. The truth is that it may be the easiest Slavic language to learn, but all Slavic language  are hard. Though it is close to Russian, there are Russians who have been living there for 20 years and still can’t understand it well.

It has few cases compared to the rest of Slavic. There are three cases, but they are present only in pronouns. The only case in nouns is vocative. This is odd because most Slavic languages have either lost or are in the process of losing the vocative, and in Bulgarian it is the only case that has been retained. Compared to English, Bulgarian is well structured and straightforward with little irregularity. In addition, Bulgarian has more Romance (mostly French) and Greek borrowings than any other Slavic languages. Romance came in via the Vlahs who lived there before the Slavs moved in and Greek from the Byzantine period. In recent years, many English borrowings have also gone in.

Bulgarian has a suffixed general article that is not found in the rest of Slavic but is apparently an areal feature borrowed from Albanian. The stress rules are nightmarish, and it seems as if there are no rules.

Bulgarian has grammatical gender, with three genders – masculine, feminine and neuter. In addition, adjectives must agree with the gender of the noun they are modifying. In English, adjectives are invariable no matter what the noun is:

pretty man
pretty woman
pretty horse
pretty table

However, the Bulgarian alphabet is comparatively simple compared to other Slavic alphabets. Since 1945, it has only had 30 letters. Compare this to the 70 letters in Polish. There are only six vowels, and it has the easiest consonant clusters in Slavic. The orthography is very regular, with no odd spellings. The Cyrillic alphabet is different for those coming from a Latin alphabet and can present problems. For one thing, letters that look like English letters are pronounced in different ways:

В is pronounced v in Bulgarian
E is pronounced eh in Bulgarian
P is pronounced r in Bulgarian

There are a number of Bulgarian letters that look like nothing you have ever seen before: Ж, Я, Ь, Ю, Й, Щ, Ш, and Ч. Bulgarian handwriting varies to a great degree and the various styles are often difficult to map back onto the typewritten letters that they represent.

While Bulgarian has the advantage of lacking much case, Bulgarian verbs are quite complex even compared to other Slavic languages. Each Bulgarian verb can have up to 3,000 forms as it changes across person, number, voice, aspect, mood, tense and gender. Bulgarian has two aspects (perfect and imperfect), voice, nine tenses, five moods and six non infinitival verbal forms.

For instance, each verb has at two aspects – simple and continuous – for each of the tenses, which are formed in different ways. Onto this they add a variety of derivatives such as prefixes, suffixes, etc. that change the meaning in subtle ways:

Aorist or Perfect:

да прочитамto read in whole a single text/book/etc (viewed as fact, that is the duration of the action does not interest us)
да изчитам – to read every book there is on the subject (viewed as fact, that is the duration of the action does not interest us)
да дочетаto finish reading something (viewed as fact, that is the duration of the action does not interest us)

Continuous or Imperfect:

да четаto be reading (viewed as an action in progress)
да прочитамto read in whole a single text/book/etc (viewed as an action in progress)
да изчитамto read every book there is on the subject (viewed as an action in progress)

Mood is very complicated. There are different ways to say the same idea depending on how you know of the event. If you know about it historically, you mark the sentence with a particular mood. If you doubt the event, you mark with another mood.

If you know it historically but doubt it, you use yet another mood. And there are more than that. These forms were apparently borrowed from Turkish. These forms are rare in world languages. One is Yamana, a Patagonian language that has only one speaker left.

In Bulgarian, you always know if something is a noun, a verb or an adjective due to its marking. You will never have the same word as an adjective, noun and verb. In English, you can have words that act as verbs, adjectives and nouns.

Let’s dance!
Let’s go to the dance.
Let’s go to dance lessons.

Bulgarian is probably the easiest Slavic language to learn.

Bulgarian gets a 3.5 rating, above average difficulty.

Macedonian is very close to Bulgarian, and some say it is a dialect of Bulgarian. However, I believe that is a separate language closely related to Bulgarian. Macedonian is said the be the easiest Slavic language to learn, easier than Bulgarian. This is because it is easier to pronounce than Bulgarian. Like Bulgarian, Macedonian has lost most all of its case. But there are very few language learning materials for Macedonian.

Macedonian gets a 3.5 rating, above average difficulty.


Serbo-Croatian, similar to Czech, has seven cases in the singular and seven in the plural, plus there are several different declensions. The vocative is still going strong in Serbo-Croatian (S-C), as in Polish, Ukrainian and Bulgarian. There 15 different types of declensions: seven tenses, three genders, three genres or moods, and two aspects. Whereas English has one word for the number 2 – two, Serbo-Croatian has 17 words or forms.

Case abbreviations below:
N = NAV – nominative, accusative, vocative
G = Genitive
D = Dative
L =Locative
I = Instrumental

Masculine inanimate gender
N dva
G dvaju
D L I dvama

Feminine gender
N dve
G dveju
D L I dvema

Mixed gender
N dvoje
G dvoga
D L I dvoma

Masculine animate gender
N dvojica
G dvojice
D L dvojici
I dvojicom

N dvojka
G dvojke
D L dvojci
I dvojkom

The grammar is incredibly complex. There are imperfective and perfective verbs, but when you try to figure out how to build one from the other, it seems irregular. This is the hardest part of Serbo-Croatian grammar, and foreigners not familiar with other Slavic tongues usually never get it right.

Serbian has a strange form called the “paucal.” It is the remains of the old dual, and it also exists in Polish and Russian.  The paucal is a verbal number like singular, plural and dual. It is used with the numbers dva (2), tri (3), četiri (4) and oba/obadva (both) and also with any number that contains 2, 3 or 4 (22, 102, 1032).

gledalac            viewer
pažljiv(i)          careful
gledalac pažljiv(i) careful viewer

1 careful viewer  jedan pažljivi gledalac 
2 careful viewers dva pažljiva gledaoca   
3 careful viewers tri pažljiva gledaoca   
5 careful viewers pet pažljivih gledalaca

Above, pažljivi gledalac is singular, pažljivih gledalaca is plural and pažljiva gledaoca is paucal.

As in English, there are many different ways to say the same thing. Pronouns are so rarely used that some learners are surprised that they exist, since pronimalization is marked on the verb as person and number. Word order is almost free or at least seems arbitrary, similar to Russian.

Serbo-Croatian, like Lithuanian, has pitch accent – low-rising, low-falling, short-rising and short-falling. It’s not the same as tone, but it’s similar. In addition to the pitch accent differentiating words, you also have an accented syllable somewhere in the word, which as in English, is unmarked. And when the word conjugates or declines, the pitch accent can jump around in the word to another syllable and even changes its type in ways that do not seem transparent. It’s almost impossible for foreigners to get this pitch-accent right.

The “hard” ch sound is written č, while the “soft” ch sound is written ć. It has syllabic r and l. Long consonant clusters are permitted. See this sentence:

Na vrh brda vrba mrda.

However, in many of these consonant clusters, a schwa is present between consonants in speech, though it is not written out.

S-C, like Russian, has words that consist of only a single consonant:


Serbo-Croatian does benefit from a phonetic orthography.

It is said that few if any foreigners ever master Serbo-Croatian well. Similar to Czech and Polish, it is said that many native speakers make mistakes in S-C even after decades of speaking it, especially in pitch accent.

Serbo-Croatian is often considered to be one of the hardest languages on Earth to learn. It is harder than Russian but not as hard as Polish.

Serbo-Croatian gets a 4.5 rating, very difficult.

Slovenian or Slovene is also a very hard language to learn, probably on a par with Serbo-Croatian. It has three number distinctions, singular, dual and plural. It’s the only major IE European language that has retained the dual. Sorbian has also retained the dual, but it is a minor tongue. However, the dual may be going out in Slovenia. In Primorska it is not used at all, and in the rest of Slovenia, the feminine dual is not used in casual speech (plural is used instead), but the masculine dual is still used for masculine nouns and mixed pairs of masculine and feminine nouns.

In addition, there are six cases, as Slovene has lost the vocative. There are 18 different declensions of the word son, but five of them are identical, so there are really only 13 different forms.

   Singular Dual       Plural 
1. Sin      Sina       Sini
2. Sina     Sinov      Sinov
3. Sinu     Sinovoma   Sinovom
4. Sina     Sinova     Sinove
5. O sinu   O sinovoma O sinovih
6. S sinom  Z sinovoma Z sini

There are seven different ways that nouns decline depending on gender, but there are exceptions to all of the gender rules. The use of particles such as pa is largely idiomatic. In addition, there is a lack of language learning materials for Slovene.

Some sounds are problematic. Learners have a hard time with the č and ž sounds. There are also “open” and “closed” vowels as in Portuguese.

Here is an example of a word that can be difficult to pronounce:


However, Slovene has the past perfect that is the same as the English tense, lost in the rest of Slavic. In addition, via contact with German and Italian, many Germanic and Romance loans have gone in. If you know some German and have some knowledge of another Slavic language, Slovene is not overwhelmingly difficult.

Some people worry that Slovene might go extinct in the near future, as it is spoken by only 2 million people. However, even this small language has 356, 881 headwords in an online dictionary. So it is clear that Slovene has plenty enough vocabulary to deal with the modern world.

Slovene is easier than Russian, Serbo-Croatian, Polish, Czech or Slovak.

Slovenian gets a 4 rating, very hard.


Eastern Baltic

Lithuanian, an archaic Indo-European Baltic tongue, is extremely difficult to learn. There are many dialects, which is interesting for such a small country, and the grammar is very difficult, with many rules. There is grammatical gender for nouns, and in addition, even numerals have gender in all cases. The language is heavily inflectional such that you can almost speak without using prepositions.

A single verb has 16 participial forms, and that is just using masculine gender for the participles. You can also add feminine forms to that verb. There are two main genders or giminės, masculine and feminine, but there is also neutral gender (bevardė giminė), which has three different forms. Verbs further decline via number (singular, dual and plural) and six different cases. There are five classes of verbs and six modes of declension for nouns (linksniai). However, Lithuanian verb tense is quite regular. You only need to remember infinitive, 3rd person present and 3rd person past, and after that, all of the conjugations are regular.

Here is an example of the Lithuanian verb:

Eiti – “to go. Ei is the verb root, and ti is in infinitival suffix.

Verbs decline according to:

Person and number
1st singular einu   I go  
3rd dual     einava we two go
1st plural   einame we go

The four tenses

2nd pl. past       Ėjote    you (guys) went
2 sing. imperfect  eidavote you used to go
2 sing. indicative einate   you go
2 sing. future     eisite   you will go

They also change according to something called “participants.” The participant paradigm has three tenses and all three genders. Participants are further divided into direct and indirect.

Regular direct participant (3 tenses, 3 genders)

Ėjęs   while he himself went
einąs  while he himself is going
eisiąs while he himself will be going

Ėjusi  while she herself went

buvo einama while it itself went
einama      while it itself was going
bus einama  while it itself will be going

Regular indirect participant (3 tenses, 3 genders)

past    eidytas     one that was forced to go
present eidomas     one that is being forced to go
future  bus eidomas one that will be forced to go

Semi participant (no tenses, 2 genders)

eidamas while going himself

eidama  while going herself

Active participant (2 tenses, no genders)

past    Ėjus   while going (in the past)
present einant while going now

2nd infinitive or budinys (no tenses)

eite in a way of going

Plusquamperfect (be + regular participants)

indicative būti   to have been gone
present    yra    has been gone
past       buvo   had been gone 
imperfect  būdavo used to have been gone 
future     bus    will have been gone

past 3pl   buvo ėję they had been gone 

Additional moods 

Imperative (all persons) 

Eik!             Go! 
Eikime!          Let's go! 
Teeina/Lai eina! Let him/her go! 

Subjunctive (all persons) 
eičiau I would go 
eitum  thou would go

In addition, while most verb marking is done via suffixes, Lithuanian can make aspect via both suffixes and prefixes, bizarrely enough (Arkadiev 2011).

Determining whether a noun is masculine or feminine is easier than in German where you often have to memorize which noun takes which gender. Lithuanian is similar to Spanish in that the ending will often give you a hint about which gender the noun takes.

Here is an example of the sort of convolutions you have to go through to attach the adjective good to a noun.

geras - good

             Masculine          Feminine

             Singular  Plural   Singular  Plural
Nominative   geras     geri     gera      geros
Genitive     gero      gerų     geros     gerų
Dative       geram     geriems  gerai     geroms
Accusative   gerą      gerus    gerą      geras
Instrumental geru      gerais   gera      geromis
Locative     gerame    geruose  geroje    gerose

The noun system in general of Lithuanian is probably more complicated even than the complex Russian noun system. Lithuanian is possibly more irregular and may have more declensions than even Polish. Learners often feel that the grammar is illogical.

Furthermore, while it does not have lexical tone per se, it does have pitch accent – there are three different pitches or degrees (laipsniai), which sound like tones but are not tones. Stress is hardly predictable and nearly needs to be learned word by word. It’s almost impossible for foreigners to get the accent right, and the accents tend to move around a lot across words during declension/conjugation such that the rules are opaque if they exist at all. It was formerly thought to be nearly random, but it has now been found that Lithuanian stress actually falls into four paradigms, so there is a system there after all.

You cannot really forget about lexical tone when learning Lithuanian, as stress is as fundamental to Lithuanian as tone is to Mandarin.

Often you need a dictionary to figure out where the accent should be on a word. Lithuanian pronunciation is also difficult. For example, look at rimti (to get calm) and rimti (serious – plural, masculine, nominative). There is a short i sound that is the same in both words, but the only difference is where the stress or pitch accent goes. Consonants undergo some complicated changes due to palatalization. Lithuanian has soft and hard (palatalized and nonpalatalized) consonants as in Russian.

Try these words and phrases:

šąla šiandien
ačiū už skanią vakarienę
čežėti šiauduose

Or this paragraph:

Labas, kaip šiandien sekasi? Aš esu iš Lietuvos, kur gyvenu visą savo gyvenimą. Lietuvių kalba yra sunkiausia iš visų pasaulyje. Ačiū už dėmesį.

Lithuanian is an archaic IE language that has preserved a lot of forms that the others have lost.

In spite of all of that, picking up the basics of Lithuanian may be easier than it seems, and while foreigners usually never get the pitch-accent down, the actual rules are fairly sensible. Nevertheless, many learners never figure out these rules and to them, there seem to be no rules for pitch accent.

Learning Lithuanian is similar to learning Latin. If you’ve been able to learn Latin, Lithuanian should not be too hard. Also, Lithuanian is very phonetic; words are pronounced how they are spelled.

Some languages that are similar to English, like Norwegian and Dutch, can be learned to a certain extent simply by learning words and ignoring grammar. I know Spanish and have been able to learn a fair amount of Portuguese, French and Italian without learning a bit of grammar in any of them.

Lithuanian won’t work that way because due to case, base words change form all the time, so it will seem like you are always running into new words, when it fact it’s the same base word declining in various case forms. There’s no shortcut with Latin and Lithuanian. You need to learn the case grammar first, or little of it will make sense.

Some say that Lithuanian is even harder to learn than the hardest Slavic languages like Polish and Czech. It may be true.

Lithuanian gets a 5 rating, extremely hard to learn.

Latvian is another Baltic language that is somewhat similar to Lithuanian. It’s also hard to learn. Try this:

Sveiki, esmu no Latvijas, un mūsu valoda ir skanīga, skaista un ar ļoti sarežģītu gramatisko sistēmu.

Latvian and Lithuanian are definitely harder to learn than Russian. They both have aspects like in Russian but have more cases than Russian, plus a lot more irregular verbs. Latvian, like Lithuanian, has a tremendous amount of inflection. The long vowels can be hard to pronounce.

Latvian is easier to learn than Lithuanian. The grammar is easier to figure out and the phonological system is much easier. Also, Latvian has lost many archaic IE features that Lithuanian has retained. Latvian has regular stress, always on the first syllable, as opposed to Lithuanian’s truly insane stress system. Latvian has fewer noun declensions, and fewer difficult consonant clusters.

Latvian gets a 4.5 rating, very hard.


Arkadiev, Peter. 2011. On the Aspectual Uses of the Prefix Be- in Lithuanian.
Baltic Linguistics 2:37-78.
Seymour, Philip H. K.; Aro, Mikko; Erskine, Jane M. and the COST Action A8 Network. 2003. Foundation Literacy Acquisition in European Orthographies. British Journal of Psychology 94:143–174.

This research takes a lot of time, and I do not get paid anything for it. If you think this website is valuable to you, please consider a a contribution to support more of this valuable research.


Filed under Albanian, Applied, Armenian, Baltic, Bulgarian language, Celtic, Czech, Danish, Descriptive, Dutch, English language, French, Gaelic, German, Germanic, Greek, Hellenic, Hindi, Icelandic, Indic, Indo-European, Indo-Hittite, Irish Gaelic, Italian, Italic, Kashmiri, Language Families, Language Learning, Language Samples, Latvian, Linguistics, Lithuanian, Norwegian, Polish, Portuguese, Romance, Russian, Sanskrit, Serbo-Croatian, Sinhala, Slavic, Slovak, Spanish, Swedish

어린 소년의 히로인 주사

This is a Korean translation of the One Boy One Needle post. The translation is by 넝근넝근, who does fine work.

Some people have found this video pretty upsetting, so exercise caution in viewing. WordPress has not explicitly told me to take this down yet, so I will leave it up until they do, but I’m not sure if it’s ok with them.

이 비디오는 실제로는 몇년 전에 등장했습니다만 지금까지 유명했는지 아니면 아마 지금부터 유명해질 것인지는 모릅니다. 몇몇 이름들이 나오지만, 히로인중독 소년들을 제외하고는 누군지 기억이 나지 않습니다.

이 비디오의 뒷이야기는 매우 이상합니다. 그리고 누구도 이 비디오가 가짜인지 진짜인지를 모릅니다.

뒷 이야기인즉슨 마약의 위험성을 알리기위한 한 러시아인이 촬영된 비디오를 편집한 영상이라는 것입니다. 러시아의 어린 소년들이 히로인을 주사하고 있는 장면을 포함해서요. 촬영된 소년은 8살 입니다. 처음 비디오를 촬영한 사람은 실제론 마약상이었습니다. 이 편집된 영상은 아마 8살 소년이 히로인을 맞다가 결국 그것때문에 죽어갈것이라고 묘사하는것 같습니다. 그는 이 비디오로 마약재활원 건립 자금을 받는다며 비디오를 서방기구에 공개하였지만 그는 돈을 받고는 재활원을 건립하지 않았습니다.

그는 나중에 경찰에 체포되었습니다. 경찰은 저 소년에 대한 살인 미수를 적용했습니다. 그는 수감되었고 러시아의 감옥에서 죽었습니다. 교도관들이 그의 죽음의 관해 말하기를 “그는 끊임없이 벽에 머리를 박다가 죽었습니다”라고 합니다 그렇겠죠 아마도 교도관들이나 다른 수감자들에게 맞거나 아니면 둘다에게 맞아 죽은듯이 보입니다.

어떻게 이런걸 아냐고요? 왜냐하면 다른 버전에는 뒤에 러시아 말로 설명이 있거든요(저는 설명이 없어 보기 좋은 버전을 가지고 있습니다.) 그 러시아어 설명이 제가 위에서 말했던 것입니다.

어 쨋든, 아무도 그 이야기가 진짜 인지는 모릅니다. 저 죽을것처럼 보이는 8살 소년의 이름도 모릅니다. 몇몇은 저 소년이 죽지 않았으며 저 죽어가는 것 처럼 보이는 건 단지 “헤로인을 주사하고 나서 보이는 정상적인 반응” 이라고 합니다. 우리는 저 마약상의 이름도 모르고 그가 체포됬는지 감옥에서 죽었는지도 모릅니다. 간단히 말해 이 뒷이야기는 아직까지 사실이라고 결론이 나지 않았습니다.

몇몇은 이 비디오가 90년대 러시아에 헤로인이 급속도로 퍼질때 등장했다고 보고 심지어 어린애들또 주사했다고 합니다. 그 후로는 잠잠해졌습니다.

어 쟀든, 가짠든 진짜는, 이 비디오는 8살소년이 헤로인을 주사하며 죽어가는(진짜 일수도 있고)장면에 놀랐습니.다. 이건 인터넷상의 아주 저속한 비디오 들중 하나입니다. 폭력적인게 없어서 전 별로 큰 충격이 없군요. 저는 스스로 저 소년이 죽는게 아니라 잠에 들며 다른 문제는 없을 거라고 생각합니다. 물론 어린애가 마약주사하는게 좋지많은 않군요.

불행하게도, 제가 찾은 다른 복사본은 처음 시작할때 King of Cordia’s Den 웹사이트의 한 멍청이가 써놓은 온갖 난잡한 설명으로 뒤덮여있더군.

1 Comment

Filed under Crime, Dope, Eurasia, Heroin, Intoxicants, Korean, Morbid, Narcotics, Overdoses, Regional, Russia, Sick, Sick and Evil, Translations

어마어마한 여드름 짜내기

This post has been translated into Italian as Un Bisturi Una Cisti (traduzione in italiano) and into French as Un Bistouri Un Kyste (en Français).

This is a Korean translation of the One Lance, One Cyst post. Most people should be able to handle watching this post, though it is rather gross. It was translated by 넝근넝근, who does incredible work.

나는 단지 의학적 지식을 궁구히 하는 목적으로 올렸을 뿐입니다이 비디오는 정말 역겹습니다.

전 이걸 단지 의학 비디오로 보고있습니다. 아마도 진료실에서 촬영한것 같군요. 보기엔 더러운 낭종 (혹은 염증, 여드름 아니면 다른 어떤걸)을 채혈하고 있습니다. 외관상으로는 피지낭으로 보입니다.

전 중년이지만 여드름 터뜨리는걸 좋아합니다. 흉터를 남긴다는 건 알지만 솔직하게 말하면 특히 당신이 손을 깨끗이 씻고 샤워를 자주 한다면 여드름은 없어질 것이고 문제가 되지 않을겁니다.

여기에 나오는 건 역사상 가장 무시무시한 여드름 같군요. 계속 짜내자 서서히 줄어들어 끝날 것 처럼 보입니다. 의사가 막 끝낼거라고 생각하는 순간 이 여드름은 또 다시 우글우글 쏟아집니다.

당신이 내가 이 영상을 올린데에 불만스럽다면.


Filed under Gross, Health, Korean, Medical Procedures, Medicine, Operations, Sick, Sick and Evil, Translations