INDEX
    Explanations

    instances of the word "mistake."

    New Auto-Interp
    Negative Logits
    DoubleQuotes
    -0.67
    OGND
    -0.61
     الحره
    -0.61
     OnInit
    -0.60
     Photographie
    -0.59
     Ceramby
    -0.57
    ंदीखरीदारी
    -0.53
    uerung
    -0.53
     rings
    -0.53
    География
    -0.53
    POSITIVE LOGITS
     Emb
    0.66
    mix
    0.66
    Emb
    0.65
    [][]
    0.65
     emb
    0.65
    Mix
    0.65
     Fr
    0.63
     Mix
    0.60
    <?=$
    0.60
     acciaio
    0.60
    Act Density 0.133%

    No Known Activations