INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     бюджет
    -0.08
    ithio
    -0.08
     Kari
    -0.08
     niche
    -0.08
     рест
    -0.08
     Lösung
    -0.08
    nega
    -0.08
    курс
    -0.07
     boda
    -0.07
     karaoke
    -0.07
    POSITIVE LOGITS
     epistem
    0.10
     acquis
    0.09
    alde
    0.09
     edin
    0.09
    0.08
     sensory
    0.08
     measurements
    0.08
     aprendizado
    0.08
     сын
    0.08
    주의
    0.08
    Act Density 0.005%

    No Known Activations