Comment

Comment on Stack Overflow in freefall: 78 percent drop in number of questions

nutsack@lemmy.dbzer0.com ⁨2⁩ ⁨months⁩ ago

I’ve posted questions, but I don’t usually need to because someone else is almost always posted it before

source

Sort:hotnew top

rumba@lemmy.zip ⁨2⁩ ⁨months⁩ ago
Works well for now. Wait until there’s something new that it hasn’t been trained on. It needs that Stack Exchange data to train on.

source
- nutsack@lemmy.dbzer0.com ⁨2⁩ ⁨months⁩ ago
  Yes, I think this will create a problem. new things won’t be created very often because there will be a barrier of training corporate controlled AI to learn them
  
  source
- cherrari@feddit.org ⁨2⁩ ⁨months⁩ ago
  I don’t think so. All AI needs now is formal specs of some technical subject, not even human readable docs, let alone translations to other languages. In some ways, this is really beautiful.
  
  source
  - SoftestSapphic@lemmy.world ⁨2⁩ ⁨months⁩ ago
    LolnAI can’t do a single thing without humans who have already done it hundreds of thousands of times feeding it their data
    
    source
    okmko@lemmy.world ⁨2⁩ ⁨months⁩ ago
    I used to push back but now I just ignore it when people think that these models have cognition because companies have pushed so hard to call it AI.
    
    source
  - 123@programming.dev ⁨2⁩ ⁨months⁩ ago
    Technical specs don’t capture the bugs, edge cases and workarounds needed for technical subjects like software.
    
    source
    cherrari@feddit.org ⁨2⁩ ⁨months⁩ ago
    I can only speak for myself obviously, and my context here is some very recent and very extensive experience of applying AI to some new software developed internally in the org where I participate. So far, AI eliminated any need for any kind of assistance with understanding and it was definitely not trained on these particular software, obviously. Hard to imagine why I’d ever go to SO to ask questions about this software, even if I could. And if it works so well on such a tiny edge case, I can’t imagine it will do a bad job on something used at scale.
    
    source
    -> View More Comments
  - rumba@lemmy.zip ⁨2⁩ ⁨months⁩ ago
    It can’t handle things it’s not trained on very well, or at least not anything substantially different from what it was trained on.
    
    It can usually apply rules it’s trained on to a small corpus of data in its training data. Give me a list of female YA authors. But when you ask it for something more general (how many R’s there are in certain words) it often fails.
    
    source
    webadict@lemmy.world ⁨2⁩ ⁨months⁩ ago
    Actually, the Rs issue is funny because it WAS trained on that exact information which is why it says strawberry has two Rs, so it’s actually more proof that it only knows what it has been given data on. The thing is, when people misspelled strawberry as “strawbery”, then naturally, people respond, " Strawberry has two Rs." The problem is that LLM learning has no concept of context because it isn’t learning anything. The reinforcement mechanism is what the majority of its data tells it. It regurgitates that strawberry has two Rs because it has been reinforced by its dataset.
    
    source
    -> View More Comments
  - skisnow@lemmy.ca ⁨2⁩ ⁨months⁩ ago
    The whole point of StackExchange is that it contained everything that isn’t in the docs.
    
    source
GamingChairModel@lemmy.world ⁨2⁩ ⁨months⁩ ago
The hot concept around the late 2000’s and early 2010’s was crowdsourcing: leveraging the expertise of volunteers to build consensus. Quora, Stack Overflow, Reddit, and similar sites came up in that time frame where people would freely lend their expertise on a platform because that platform had a pretty good rule set for encouraging that kind of collaboration and consensus building.

Monetizing that goodwill didn’t just ruin the look and feel of the sites: it permanently altered people’s willingness to participate in those communities. Some, of course, don’t mind contributing. But many do choose to sit things out when they see the whole arrangement as enriching an undeserving middleman.

source
- rumba@lemmy.zip ⁨2⁩ ⁨months⁩ ago
  Probably explains why quora started sending me multiple daily emails about shit i didn’t care about and removed unsubscribe buttons form the emails.
  
  I don’t delete many accounts… but that was one of them
  
  source
Gsus4@mander.xyz ⁨2⁩ ⁨months⁩ ago
What we’re all afraid is that cheap slop is going to make stack broke/close/bought/private and then it will be removed from the public domain…then jack up the price of slop when the alternative is gone…

source
- NikkiDimes@lemmy.world ⁨2⁩ ⁨months⁩ ago
  I do wonder then, as new languages and tools are developed, how quickly will AI models be able to parrot information on their use, if sources like stackoverflow cease to exist.
  
  source
  - Gsus4@mander.xyz ⁨2⁩ ⁨months⁩ ago
    I think this is a classic of privatization of commons, so that nobody can compete with them later without free public datasets…
    
    source
  - rumba@lemmy.zip ⁨2⁩ ⁨months⁩ ago
    It’ll certainly be of lesser quality even if they go through steps to make it able to address it.
    
    good documentation and open projects ported might be enough to give you working code, but it’s not going to be able to optimize it without being trained on tons of optimization data.
    
    source
- falseWhite@lemmy.world ⁨2⁩ ⁨months⁩ ago
  There are free open source models you can run locally and they have all the answers.
  
  source
  - Gsus4@mander.xyz ⁨2⁩ ⁨months⁩ ago
    But can you train on them? What happens to the original dataset
    
    source
    falseWhite@lemmy.world ⁨2⁩ ⁨months⁩ ago
    There are open weight models that users can download and run locally. Because the weights are open, they can be customised and fine tuned.
    
    And then there are fully open source models, that publish everything, the model with open weights, the training source code, as well as the full training dataset.
    
    source