INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    çylyk
    -0.08
    цать
    -0.08
    chezo
    -0.08
    hew
    -0.08
    ۱۳۹
    -0.08
    ván
    -0.08
    C
    -0.08
    цов
    -0.08
    -ekwu
    -0.08
     pensé
    -0.08
    POSITIVE LOGITS
    Is
    0.16
     Is
    0.14
    	Is
    0.12
    _is
    0.11
    _Is
    0.11
    (Is
    0.10
    -is
    0.10
    is
    0.10
    RI
    0.09
    .Is
    0.09
    Act Density 0.001%

    No Known Activations