• @UnderpantsWeevil@lemmy.world
    link
    fedilink
    English
    51
    edit-2
    9 months ago

    Hm even with DeepSeek being more efficient, wouldn’t that just mean the rich corps throw the same amount of hardware at it to achieve a better result?

    Only up to the point where the AI models yield value (which is already heavily speculative). If nothing else, DeepSeek makes Altman’s plan for $1T in new data-centers look like overkill.

    The revelation that you can get 100x gains by optimizing your code rather than throwing endless compute at your model means the value of graphics cards goes down relative to the value of PhD-tier developers. Why burn through a hundred warehouses full of cards to do what a university mathematics department can deliver in half the time?

    • @AppleTea@lemmy.zip
      link
      fedilink
      English
      8
      edit-2
      9 months ago

      you can get 100x gains by optimizing your code rather than throwing endless compute at your model

      woah, that sounds dangerously close to saying this is all just developing computer software. Don’t you know we’re trying to build God???

      • @UnderpantsWeevil@lemmy.world
        link
        fedilink
        English
        29 months ago

        Altman insisting that once the model is good enough, it will program itself was the moment I wrote the whole thing off as a flop.