💡𝚂𝗆𝖺𝗋𝗍𝗆𝖺𝗇 𝙰𝗉𝗉𝗌📱 to Programming@programming.devEnglish • 20 hours agoDo not Interrupt Developers, Study Saysshiftmag.devexternal-linkmessage-square54fedilinkarrow-up1289arrow-down12
arrow-up1287arrow-down1external-linkDo not Interrupt Developers, Study Saysshiftmag.dev💡𝚂𝗆𝖺𝗋𝗍𝗆𝖺𝗇 𝙰𝗉𝗉𝗌📱 to Programming@programming.devEnglish • 20 hours agomessage-square54fedilink
minus-squareFreziklinkfedilinkEnglish1•7 hours agoFWIW, it doesn’t work. The preprocessing for LLM training isn’t going to be fooled by that. It’s just making things harder for everyone to read.
minus-squareFreziklinkfedilinkEnglish2•5 hours agoI’d expect that any trick that becomes popular enough would have a simple workaround. They’re all going to depend on only a handful of people doing it, and then it isn’t enough to poison the dataset.
FWIW, it doesn’t work. The preprocessing for LLM training isn’t going to be fooled by that. It’s just making things harder for everyone to read.
Hmm, seriously? Does it also ignore zalgo text?
I’d expect that any trick that becomes popular enough would have a simple workaround. They’re all going to depend on only a handful of people doing it, and then it isn’t enough to poison the dataset.