INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    compet
    -0.09
    -0.08
    цій
    -0.07
    -0.07
    conn
    -0.07
    controller
    -0.07
    exper
    -0.07
     مفت
    -0.07
     corso
    -0.07
    CONS
    -0.07
    POSITIVE LOGITS
    ינות
    0.08
     princip
    0.08
     voces
    0.08
     занятия
    0.07
    /Re
    0.07
     fürs
    0.07
    0.07
    ივ
    0.07
    -Re
    0.07
    .Kind
    0.07
    Act Density 0.001%

    No Known Activations