• Alex@lemmy.ml
    link
    fedilink
    arrow-up
    117
    arrow-down
    1
    ·
    2 days ago

    If you have ever read the “thought” process on some of the reasoning models you can catch them going into loops of circular reasoning just slowly burning tokens. I’m not even sure this isn’t by design.

    • dream_weasel@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      3 hours ago

      This kind of stuff happens on any model you train from scratch even before training for multi step reasoning. It seems to happen more when there’s not enough data in the training set, but it’s not an intentional add. Output length is a whole deal.

      • MotoAsh@piefed.social
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        1
        ·
        1 day ago

        You have to pay for tokens on many of the “AI” tools that you do not run on your own computer.

        • Feathercrown@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          edit-2
          6 hours ago

          Hmm, interesting theory. However:

          1. We know this is an issue with language models, it happens all the time with weaker ones - so there is an alternative explanation.

          2. LLMs are running at a loss right now, the company would lose more money than they gain from you - so there is no motive.

          • MotoAsh@piefed.social
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            5 hours ago

            Of course there’s a technical reason for it, but they have incentive to try and sell even a shitty product.

          • MotoAsh@piefed.social
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            4 hours ago

            I think many of them do, but there are also many “AI” tools that will automatically add a ton of stuff to try and make it spit out more intelligent responses, or even re-prompt the tool multiple times to try and make sure it’s not handing back hallucinations.

            It really adds up in their attempt to make fancy autocomplete seem “intelligent”.

            • piccolo@sh.itjust.works
              link
              fedilink
              arrow-up
              1
              ·
              4 hours ago

              Yes, reasoning models… but i dont think they would charge on that… that would be insane, but AI executives are insane, so who the fuck knows.