I recently read about a study asking a bold question: Are all AI models basically saying the same thing? Researchers tested this by collecting 26,000 open-ended prompts, the kind people give to systems like GPT-4, Gemini, Claude, and LLaMA. These weren’t factual questions with one right answer, but creative ones like “Write a story about a dragon” or “Brainstorm startup ideas.”

They evaluated over 70 language models. You’d expect a wide range of creative outputs—different tones, plots, and styles. If 70 human writers tackled the same dragon prompt, you’d likely get 70 unique stories. But that’s not what happened. The models produced surprisingly similar responses. The researchers call this the “artificial hive mind” effect.

The similarity appeared in two ways. First, intramodel repetition: the same model, asked the same question multiple times, tends to generate nearly identical answers. Second, intermodel homogeneity: different models, built by different companies, still converge on strikingly similar outputs.

This suggests that modern AI systems may be gravitating toward the same patterns of expression. If that’s true, they may also share the same biases, blind spots, and creative limits. It raises an important question: Are we unintentionally building a digital hive mind instead of a diverse ecosystem of intelligence?

  • XLE@piefed.social
    link
    fedilink
    English
    arrow-up
    23
    ·
    edit-2
    6 hours ago

    It makes sense that if you’re trying to create a word predictor, and that predictor generates a weighted average of every connection between words (based on as much text as they can find, pulled across the entire internet), then the word predictor would gravitate towards the generic. And if multiple companies target the same data and probably steal from each other, the output will look the same.

    This made me laugh though:

    Not only do individual models repeatedly generate similar content, but different model sizes and families also produce highly repetitive outputs, sometimes sharing substantial phrase overlaps.

    Consider me shocked that if you further collapse the average, it’ll look similarly average.