INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ка
    2.22
    ون
    1.82
    1.70
    matic
    1.66
    punkte
    1.65
    رش
    1.64
    ার
    1.58
    א
    1.57
    го
    1.55
    1.55
    POSITIVE LOGITS
    َ
    1.99
     negeri
    1.98
    aj
    1.90
    στε
    1.83
    om
    1.80
    ut
    1.73
    utie
    1.72
    ERTS
    1.72
    1.72
     grabando
    1.66
    Act Density 0.015%

    No Known Activations