• 1 hour

    I’m not sure, if I understand the environment completely

    Those agents were the virtual incarnations of the AI in the sim city and the respective government - correct?
    And the AI needed to take care, that those agents didn’t died, like of hunger or what?
    That’s not really what those LLMs are trained for.

    Not sure, what they expected

    Currently searching the article for the original source, maybe this gives more insight

    Edit: ah, just in the first paragraphs it is
    https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy
    Completely missed it on the first read.
    Let’s see if this makes more sense…

    Edit 2: ok, if I get this right, those agents really were specific virtual individuals
    Not sure what they expected. First, LLMs are not really build to “live” as an individual as they aren’t real intelligence and can only role play individuals based on their training data.
    Second, why should they be super moral or “better”?
    Again, they just role play depending on their training data and built-in prompt bias (not sure what the prompt injection of the company is called)

    If you train an AI on governing such a world, it probably start gaming the system, depending on what values are important to “win”
    As we have already seen with machine learning in the last decade(s?)

    Funny experiment nevertheless, but not really useful in my eyes - and I’m everything but a defender of the current use of LLMs

  • 7 hours

    The lab described Gemini’s world as a “shared hallucination” among the agents, which is probably better than diverging hallucinations

    “We reject your reality and substitute our own.”

    Why should we trust this bullshit with anything serious again?

    • 6 hours

      Only the rubes trusted it. The rest of intelligent society has actively been warning people about this exact situation for decades.in books, in movies, in songs, and now memes.

    • 6 hours

      What do you mean “again“? Were you ever foolish enough to trust it once?

    • 5 hours

      which is probably better than diverging hallucinations

      Shouldn’t it be the other way around?

      • 5 hours

        I think it’s meant as kind of a joke, but both are shit really.

        Shared might indicate they’re able to keep some level of consistency, but since it’s only consistent in the way it produces bullshit, it’s stil useless (and the worst part is it might be more convincing).

    • I imagine that’s what it would do if it actually had any kind of intelligence, but this is just more evidence that there’s no intelligence there at all. Just mimicry and sycophantism