Overall, when tested on 40 prompts, DeepSeek was found to have a similar energy efficiency to the Meta model, but DeepSeek tended to generate much longer responses and therefore was found to use 87% more energy.

  • Onno (VK6FLAB)
    link
    fedilink
    English
    122 months ago

    And here I thought that the energy consumption was in the training.

    • AatubeOP
      link
      fedilink
      12 months ago

      The issue might be that the energy it saves in training is offset by its more intensive techniques for answering questions, and by the long answers they produce.