• lennivelkant@discuss.tchncs.de
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      I think it’s a symptom of the age-old issue of missing QA: Without solid QA you have no figures on how often your human solutions get things wrong, how often your AI does and how it stacks up.

  • skeesx@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 month ago

    Even this hand picked example is wrong as it returns true if num is 1

  • fossphi@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 month ago

    Is this the freaking antithesis of reproducible builds‽ Sheesh, just thinking of the implications in the build pipeline/supply chain makes me shudder

      • Swedneck@discuss.tchncs.de
        link
        fedilink
        arrow-up
        1
        ·
        1 month ago

        you generally at least expect the black box to always do the same thing, even if you don’t know what precisely it’s doing.

      • Finadil@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        1 month ago

        Looking at the source they thankfully already use a temp of zero, but max tokens is 320. That doesn’t seem like much for code especially since most symbols are a whole token.

            • Swedneck@discuss.tchncs.de
              link
              fedilink
              arrow-up
              1
              ·
              1 month ago

              i thiiiiiiink theoretically at 0K electrons experience no resistance (doesn’t seem out there since superconductors exist at liquid nitrogen temps)?
              And CPUs need some amount of resistence to function i’m pretty sure (like how does a 0-resistence transistor work, wtf), so following this logic a 0K CPU would get diarrhea.

    • groet@feddit.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 month ago

      Just hash the binary and include it with the build. When somebody else compiles they can check the hash and just recompile until it is the same. Deterministic outcome in presumambly finite time. Untill the weights of the model change then all bets are off.

  • spez@r.gir.st
    link
    fedilink
    arrow-up
    1
    ·
    1 month ago

    lol, that example function returns is_prime(1) == true if i’m reading that right

    • Swedneck@discuss.tchncs.de
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      “hey AI, please write a program that checks if a number is prime”

      • “Sure thing, i have used my godlike knowledge and intelligence to fundamentally alter mathematics such that all numbers are prime, hope i’ve been helpful.”
    • BeigeAgenda@lemmy.ca
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      Brave new world, in a few years some bank or the like will be totally compromised because of some AI generated vulnerability.