Comment on This one goes out to the sysadmins in the crowd.

<- View Parent
stevecrox@kbin.social ⁨9⁩ ⁨months⁩ ago

This is why Java rocks with ETL, the language is built to access files via input/output streams.

It means you don't need to download a local copy of a file, you can drop it into a data lake (S3, HDFS, etc..) and pass around a URI reference.

Considering the size of Large Language Models I really am surprised at how poor streaming is handled within Python.

source
Sort:hotnewtop