INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    uk
    -1.52
    ukas
    -1.19
    UK
    -1.15
    ukan
    -1.01
     uk
    -0.99
    uka
    -0.97
     UK
    -0.94
    uki
    -0.92
    uker
    -0.91
    ukk
    -0.86
    POSITIVE LOGITS
    ิลปะ
    0.49
    Startup
    0.45
     carburetor
    0.44
    nocześnie
    0.43
    abestanden
    0.43
     armées
    0.43
     parlor
    0.42
    OV
    0.42
     Startup
    0.42
    sf
    0.42
    Act Density 0.016%

    No Known Activations