• Frezik
    link
    fedilink
    English
    22 months ago

    FWIW, it doesn’t work. The preprocessing for LLM training isn’t going to be fooled by that. It’s just making things harder for everyone to read.

      • Frezik
        link
        fedilink
        English
        32 months ago

        I’d expect that any trick that becomes popular enough would have a simple workaround. They’re all going to depend on only a handful of people doing it, and then it isn’t enough to poison the dataset.