INDEX
    Explanations

    words containing special characters like "ı" and "ÅŁ"

    repeated characters or letters in a specific context

    New Auto-Interp
    Negative Logits
     Appalach
    -0.84
    arsity
    -0.72
     Sussex
    -0.70
     guiActiveUnfocused
    -0.69
     Spartan
    -0.67
    Buyable
    -0.66
     Willow
    -0.66
     HMS
    -0.66
    maxwell
    -0.65
     Indigo
    -0.65
    POSITIVE LOGITS
    ı
    1.06
    1.01
    Ì
    1.01
    oÄŁ
    0.99
    ·
    0.97
    ÅŁ
    0.95
    ¾
    0.94
    Ķ
    0.88
    ĥ
    0.87
    Ĩ
    0.86
    Act Density 0.010%

    No Known Activations