INDEX
Explanations
the word "but" and its variations to highlight contrasting ideas or exceptions
New Auto-Interp
Negative Logits
eniz
-0.07
Kaynak
-0.07
kıl
-0.07
åĽ
-0.07
åĨĨ
-0.07
lại
-0.07
enaire
-0.07
Ìģt
-0.07
oti
-0.07
.dump
-0.07
POSITIVE LOGITS
otherwise
0.08
Anyway
0.08
nevertheless
0.07
Anyway
0.07
still
0.07
basically
0.07
Still
0.07
fine
0.07
nonetheless
0.06
Otherwise
0.06
Activations Density 0.037%