Artificially intelligent, inherently racist

A new study regarding online language prediction models reveals that they discriminate against young, non-white men

Talk of artificial intelligence tends to fall into two camps: that of an interconnectedness that streamlines every aspect of human life – or a dystopian HAL 9000-type technological singularity in which “I’m sorry, Dave. I’m afraid I can’t do that,” is the last thing we hear before the machines take over and turn us into fleshy slaves.

Right now, we’re in the grey zone of prototypes, so some of our forays into AI are less than perfect. Take language prediction models. They’re used in everything from Google searches to legal cases, but a new study by researchers at China’s National University of Defense Technology and the University of Copenhagen shows they have a systemic racial bias.

Deeply ingrained tech
The language models under the microscope were ELECTRA, GPT-2, BERT, DistilBERT, ALBERT and RoBERTa. If you’re wondering why so many are called ‘Bert’, they’re all offshoots of the original ‘Bidirectional Encoder Representations from Transformers’ – a type of machine-learning technique developed by Google in 2018. 

To give an idea of how prevalent these models are: at the end of 2019, BERT had been adopted by Google’s search engine in 70 languages. By 2020 the model was used in almost every English-language search query. This is the technology that fills in the gap in the search bar when you type “Why am I so ___?”

The study in detail
The study measured the models’ performance differences across demographics in so-called English-language ‘cloze tests’ (fill-in-the-gap tests). Since the cloze task is how BERT systems are trained, researchers were able to evaluate the models directly.

Some 3,085 sentences were completed by 307 human subjects asked to fill in the most likely word based on their experience. They were sorted into 16 demographics according to age, gender, education and race. The ‘fairness’ of the language model responses was measured by whether the risk of error across any two demographics was roughly equal.

The results showed a systemic bias against young non-white male speakers. Older, white speakers were also poorly aligned. Not only do the models learn stereotypical associations, they also learn to speak more like some than like others – in this case white men under the age of 40.

Why is it important?
We already know that BERT is an integral part of our online navigation system, so users who do not align with the models receive unequal results and opportunities.

When GPT-2 was announced in February 2019 by San Francisco technology company OpenAI, James Vincent of The Verge described its writing as “one of the most exciting examples yet” of language generation programs.

“Give it a fake headline, and it’ll write the rest of the article, complete with fake quotations and statistics. Feed it the first line of a short story, and it’ll tell you what happens to your character next,” he said.

The Guardian called it “plausible newspaper prose”, while journalists at Vox mused that GPT-2 may be the technology that kicks them out of their jobs. A study by the University of Amsterdam even found that some participants were unable to distinguish poems generated by GPT-2 from those written by humans.

The upshot should be better training, argue the researchers at the University of Copenhagen, so the models more accurately represent the diversity of users.




  • Denmark to explore screening citizenship applicants for anti-democratic sentiments

    Denmark to explore screening citizenship applicants for anti-democratic sentiments

    A few weeks after Alex Vanopslagh’s comments about “right values,” the government announced that an expert committee would be established to examine the feasibility of screening citizenship applicants for anti-democratic attitudes.

  • The Future Copenhagen

    The Future Copenhagen

    The municipality plan encompasses building 40,000 houses by 2036 in order to help drive real estate prices down. But this is not the only huge project that will change the shape of the city: Lynetteholmen, M5 metro line, the Eastern Ring Road, and Jernbanebyen will transform Copenhagen into something different from what we know today

  • It’s not you: winter depression is affecting many people

    It’s not you: winter depression is affecting many people

    Many people in Denmark are facing hard times marked by sadness, anxiety, and apathy. It’s called winter depression, and it’s a widespread phenomenon during the cold months in Nordic countries.

  • Crime rates are rising, but people are safer

    Crime rates are rising, but people are safer

    Crime in Denmark is increasing for the second consecutive year, but it is more focused on property, while people appear to be safer than before. Over the past year, there were fewer incidents of violence

  • Taylor Swift and Martin Brygmann lead Google’s 2024 searches in Denmark

    Taylor Swift and Martin Brygmann lead Google’s 2024 searches in Denmark

    Google published the list of the top searched topics in Denmark during 2024. Taylor Swift is still on top, but domestic and foreign politics drew a lot of attention

  • Novo Nordisk invests 8.5 billion DKK in new Odense facility

    Novo Nordisk invests 8.5 billion DKK in new Odense facility

    Despite Novo’s announcement that its growth abroad will be larger than in Denmark, the company announced this morning an 8.5 billion DKK investment for a new facility in Odense. This is the first time the company has established a new production site in Denmark this century.