• AmbitiousProcess (they/them)@piefed.social
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 days ago

    Most AI models at this point won’t see significant gains from training on such a small sample of code.

    You don’t need a whole corporation’s code to make a functional model, you need the whole world’s.

    Adding a tiny bit of your own company’s code to the mix doesn’t really do anything to change the model much, so they generally won’t do it for that reason. Tons of training costs, the only benefit is that the model is very very very slightly fine tuned to kinda sorta produce code that’s maybe possibly a little more stylistically similar to yours.

    • treadful@lemmy.zipOP
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      3
      ·
      2 days ago

      We’re talking about huge companies with unfathomably huge codebases written by tens of thousands of people. They control significant chunks of the world’s code. It would be stupid not to at least include it in an internal model.

      • fubbernuckin@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        1 day ago

        As big as some individual corporations are, the world (including every other massive corporation) is much bigger.