INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     контроль
    -0.07
     sailor
    -0.07
    (priv
    -0.06
     ребенка
    -0.06
    ییر
    -0.06
     Wien
    -0.06
     їм
    -0.06
     Drivers
    -0.06
     편집
    -0.06
    ۱۰
    -0.06
    POSITIVE LOGITS
     statue
    0.17
     statues
    0.14
     Statue
    0.13
    :def
    0.08
    istles
    0.08
    Ε
    0.07
    .offsetTop
    0.07
     zaz
    0.07
    GE
    0.07
    atin
    0.07
    Act Density 0.003%

    No Known Activations