• More GPUs =/= better AI.

    More data =/= better AI.

    More tech bro “superstars” =/= better AI

    This is what people like Musk and Zuckerberg don’t seem to understand.

    Training scales very poorly past a certain cluster size, especially if you go for new architectures to actually pursue improvements, hence reports of GPUs being tasked with busywork just to meet utilization quotes. Increasing data size and training scale hits diminishing returns, quick, or even regresses models because the bulk data is shit and the model is too inefficient. A prime example: Llama 4. “Superstar” AI engineers are better and Tweeting and sycophantic gaslighting than coding something interesting.

    In other words, I’d argue there’s a much smaller “sweet spot” for pure LLMs that these billionaires are way, way past. And no one is telling them no because they’re too rich to hear it. It’s all going to collapse on itself because scaling like that just does not work.

    • This is what people like Musk and Zuckerberg don’t seem to understand.

      They know, but can’t admit it. Pretend and keep the stock price going up makes them money, so for them it’s still working. Meanwhile they hope for a breakthrough, bail-outs or a new hype train to jump on.

      • No, they don’t.

        They’re surrounded by yes men. And from everything I hear them say, they don’t understand the first thing about how LLMs actually work.

  • 2 hours

    Ok but why is the thumbnail a picture of Garrison Keillor?

  • Great, but why would I respect the opinion of someone who works at Meta?

    • 3 hours

      You shouldn’t respect anyone’s opinion: you hear the opinion and judge the contents on it’s own merit. Everyone you respect will have the odd terrible take, and people you loathe will periodically have a banger thought.

      A ton of Dawkins fan boys are huffing copium since the man decided Claude is conscious (and for some incelly reason a woman). A lot of MAGA asshats are trying to torch data centers.

      If you offload your own critical evaluation of ideas merely to an adjudication of the speaker… honestly I hear it no differently than “because chatGPT said”.

      This is just the laziest ad homium. Tell me, what do you think about the content? I am genuinely curious, and that analysis is 1000x more valuable.

    • 4 hours

      3 reasons: “one of the "Godfathers of AI … who previously served as Meta’s chief AI scientist

      • Yeah but the association is enough for some people. He probably doesn’t even use Arch Linux.

  • Wow the only technology that is designed to lie to you is a failure? Pattern recognition means it can recognize wrong patterns.