INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Sergio
    -0.07
    									 
    -0.07
    ugo
    -0.07
    ΑΠ
    -0.07
    irting
    -0.07
     поба
    -0.06
    -0.06
     younger
    -0.06
    amous
    -0.06
     ignorant
    -0.06
    POSITIVE LOGITS
     chief
    0.10
     Chief
    0.07
    Chief
    0.07
    toDouble
    0.06
    'i
    0.06
     quicker
    0.06
    ็กชาย
    0.06
    шев
    0.06
    /item
    0.06
     accountability
    0.06
    Act Density 0.002%

    No Known Activations