INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ibu
    -0.07
     strán
    -0.06
    	usage
    -0.06
     mouth
    -0.06
     klik
    -0.06
     TECH
    -0.06
    -mouth
    -0.06
     Slayer
    -0.06
    -0.06
     який
    -0.06
    POSITIVE LOGITS
     Sommer
    0.07
    ớp
    0.06
     работа
    0.06
    bere
    0.06
     padre
    0.06
     내가
    0.06
    нями
    0.06
    แดง
    0.06
     مسجد
    0.06
     топ
    0.06
    Act Density 0.004%

    No Known Activations