INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     safe
    -0.06
    idepress
    -0.06
    ecký
    -0.06
     mekan
    -0.06
    obot
    -0.06
     zw
    -0.06
    division
    -0.06
     goo
    -0.06
     contradictory
    -0.06
     woke
    -0.06
    POSITIVE LOGITS
     talent
    0.13
     talents
    0.11
     talented
    0.09
     Talent
    0.09
    人気
    0.09
     genius
    0.08
     Tarif
    0.08
     inventory
    0.07
    0.07
    лет
    0.07
    Act Density 0.005%

    No Known Activations