INDEX
    Explanations

    words written using characters from different languages or possibly corrupted text

    symbols or characters related to formatting or encoding issues

    New Auto-Interp
    Negative Logits
     Mandela
    -0.64
    ifying
    -0.64
     cloaked
    -0.63
    icity
    -0.62
     LIFE
    -0.62
     Perkins
    -0.62
     Dragonbound
    -0.61
    ification
    -0.61
    "]=>
    -0.60
     confounding
    -0.59
    POSITIVE LOGITS
    ĥ
    2.17
    Ĺ
    1.97
    ħ
    1.88
    Ľ
    1.81
    ī
    1.81
    Ļ
    1.77
    ı
    1.76
    Ŀ
    1.75
    ij
    1.75
    Į
    1.72
    Act Density 0.024%

    No Known Activations