INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    afe
    0.86
    0.81
    но
    0.80
    에서도
    0.80
    0.77
    リズム
    0.77
     друго
    0.77
    取り付け
    0.76
    psie
    0.76
    aglia
    0.75
    POSITIVE LOGITS
     endow
    0.92
     headsets
    0.85
     gminy
    0.82
    ಥವಾ
    0.80
    groomed
    0.80
    сных
    0.79
     Glast
    0.78
    うる
    0.77
    ья
    0.77
    0.77
    Act Density 0.000%

    No Known Activations