Do you have a source for this claim? I see this report by Google and MIT Tech Review that says image/video generation does use a lot of energy compared to text generation.
Taking the data from those articles, we get this table:
| AI Activity | Source | Energy Use (per prompt) | Everyday Comparison | 
|---|---|---|---|
| Median Gemini Text Prompt | Google Report | 0.24 Wh | Less energy than watching a 100W TV for 9 seconds. | 
| High-Quality AI Image | MIT Article | ~1.22 Wh | Running a standard microwave for about 4 seconds. | 
| Complex AI Text Query | MIT Article | ~1.86 Wh | Roughly equivalent to charging a pair of wireless earbuds for 2-3 minutes. | 
| Single AI Video (5-sec) | MIT Article | ~944 Wh (0.94 kWh) | Nearly the same energy as running a full, energy-efficient dishwasher cycle. | 
| "Daily AI Habit" | MIT Article | ~2,900 Wh (2.9 kWh) | A bit more than an average US refrigerator consumes in a full 24-hour period. | 
MangoCats@feddit.it 1 month ago
Another way of looking at this: A “Daily AI Habit” on your table is about the same as driving a Tesla 10 miles…