Comment

jfrnz@lemm.ee ⁨5⁩ ⁨months⁩ ago

Running a 500W GPU 24/7 for a full year is less than a quarter of the energy consumed by the average automobile in the US (in 2000). I don’t know how many GPUs this person has or how long it took to fine tune the model, but it’s clearly not creating an ecological disaster. Please understand there is a huge difference between the power consumed by companies training cutting-edge models at massive scale/speed, compared to a locally deployed model doing only fine tuning and inferencing.

source

Sort:hotnew top

utopiah@lemmy.world ⁨5⁩ ⁨months⁩ ago
I specifically asked about the training part, not the fine tuning but thanks to clarifying.

source
- jfrnz@lemm.ee ⁨5⁩ ⁨months⁩ ago
  The point is that OP (most probably) didn’t train it — they downloaded a pre-trained model and only did fine-tuning and inference.
  
  source
  - utopiah@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Right, my point is exactly that though, that OP by having just downloaded it might not realize the training costs. They might be low but on average they are quite high, at least relative to fine-tuning or inference. So my question was precisely to highlight that running locally while not knowing the training cost is naive, ecologically speaking. They did clarify though that they do not care so that’s coherent for them. I’m insisting on that point because maybe others would think “Oh… I can run a model locally, then it’s not <<evil>>” so I’m trying to clarify (and please let me know if I’m wrong) that it is good for privacy but the upfront training cost are not insignificant and might lead some people to prefer NOT relying on very costly to train models and prefer others, or a even a totally different solution.
    
    source
    jfrnz@lemm.ee ⁨5⁩ ⁨months⁩ ago
    The model exists already — abstaining from using it doesn’t make the energy consumption go away. I don’t think it’s reasonable to let historic energy costs drive what you do, else you would never touch a computer.
    
    source
    -> View More Comments