INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    é
    0.90
    λ
    0.90
    ib
    0.86
    @
    0.85
    ase
    0.83
    FS
    0.82
    arf
    0.82
    IE
    0.81
    nf
    0.80
    offs
    0.80
    POSITIVE LOGITS
     Ча
    1.14
     Това
    0.96
     Ergebn
    0.95
     Ту
    0.94
     Ј
    0.92
     Более
    0.91
     После
    0.91
     Фи
    0.91
     Анали
    0.90
     Он
    0.89
    Act Density 0.000%

    No Known Activations