You put a lot of work into this.
Comment on Not for me, tho
tetris11@lemmy.ml 1 day ago
Which language provides the most random alphabetically sorted sequence?
Data
| Base | English | Dutch | German | Turkish | |------+---------+-------+--------+---------| | 1 | 8 | 8 | 8 | 6 | | 2 | 11 | 3 | 3 | 5 | | 3 | 5 | 1 | 1 | 1 | | 4 | 4 | 11 | 11 | 9 | | 5 | 9 | 9 | 5 | 4 | | 6 | 1 | 10 | 9 | 2 | | 7 | 7 | 12 | 6 | 10 | | 8 | 6 | 2 | 7 | 11 | | 9 | 10 | 4 | 4 | 12 | | 10 | 3 | 5 | 10 | 8 | | 11 | 12 | 6 | 2 | 3 | | 12 | 2 | 7 | 12 | 7 |
- Sourced from comments in thread
Plot with Pearson Score
Code
bash gnuplot -p -e ’ set xlabel “Base Sequence”; set ylabel “Alphabetic”; set xtics (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12); set ytics (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12); stats “alphabetic.tab” using 1:2 name “E”; stats “” using 1:3 name “D”; stats “” using 1:4 name “G”; stats “” using 1:5 name “T”; set label 1 sprintf(“Corr. [Eng = %.3f], [Dut = %.3f], [Ger = %.3f], [Tur = %.3f]”, E_correlation, D_correlation, G_correlation, T_correlation) at graph 0.95,0.05 right; plot “” using 1:2 with lines title columnhead(2), “” using 1:3 with lines title columnhead(3), “” using 1:4 with lines title columnhead(4), “” using 1:5 with lines title columnhead(5) ’
It looks like the most random language is Dutch (closest to zero), and Turkish appears to be the least random (probably the 10,11,12 sequence skewed it)
jaybone@lemmy.zip 1 day ago
FeatherConstrictor@sh.itjust.works 1 day ago
Thank you for doing and sharing this
null@slrpnk.net 1 day ago
This is the second comment I’ve seen like this from you.
Please never stop.
diemartin@sh.itjust.works 1 day ago
I didn’t expect soneone to put that much effort into it.
Thanks! This is awesome!
Resonosity@lemmy.dbzer0.com 13 hours ago
c/dataisbeautiful