Comment on Flaw in Git bloated Microsoft repository by a factor of 35.
breadsmasher@lemmy.world 2 weeks agoThe culprit: name-hash collisions. With files like changelog.md and changelog.json, Git found all kinds of differences with each commit even though there often were none, which gradually added 173GB of unnecessary bloat. However, the changelogs were in completely different packages. The algorithm checked only the last 16 characters of the path, so package differences were not considered.
drre@feddit.org 2 weeks ago
thanks for the reply, but i think i got that. from the linked article:
but wouldn’t these erroneous diffs not show up in
git diff
? it seems that they were pushing (maybe automatically?)without inspecting the diffs first