Comment on Pdf to odt/docx conversion has me weeping!
mesamunefire@piefed.social 1 week agoAs a dev the reason pdf is so strange is because it's a compound format. It can be just images strung together. It can also be pure text with fonts, ect...etc ..
If you open the file as a text file, you can see this. It's many different formats in a trenchcoat.
Botzo@lemmy.world 1 week ago
Yeah, also a dev here. I’d be so happy if they’d parted ways with the 90s legacy bits at some point. Just glad there are enough parsing libraries that I’ll never need to care (right? Please tell me I’m right!).
mesamunefire@piefed.social 6 days ago
I hope your right too lol.