INDEX
Explanations
negations and phrases related to limitations or contradictions
New Auto-Interp
Negative Logits
olia
-0.14
asley
-0.13
apus
-0.13
arger
-0.13
ountains
-0.13
còn
-0.13
åĩ¡
-0.13
)(((
-0.13
MMdd
-0.13
анÑģи
-0.13
POSITIVE LOGITS
either
1.62
either
1.38
Either
1.35
EITHER
1.29
Either
1.26
либо
0.77
soit
0.68
ither
0.68
neither
0.63
ITHER
0.60
Activations Density 0.411%