INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    (New
    -0.08
    Wrap
    -0.07
    .Equal
    -0.07
     wspóln
    -0.07
    -0.07
    -indent
    -0.06
     المجال
    -0.06
    做过
    -0.06
    西班
    -0.06
    .bg
    -0.06
    POSITIVE LOGITS
    ouv
    0.08
     )]↵
    0.06
     Ellie
    0.06
    ailand
    0.06
     Security
    0.06
    ))+
    0.06
    0.06
     ולה
    0.06
    呼び
    0.06
    0.06
    Act Density 0.105%

    No Known Activations