INDEX
Explanations
instances of the word "other" and its variations, indicating comparisons or distinctions
New Auto-Interp
Negative Logits
icle
-0.16
lain
-0.15
cken
-0.15
adaÅŁ
-0.14
swers
-0.14
ý
-0.13
rong
-0.13
ittest
-0.13
both
-0.13
rick
-0.13
POSITIVE LOGITS
-than
0.40
world
0.35
than
0.34
similar
0.32
similarly
0.32
wis
0.30
ewise
0.30
equally
0.28
-world
0.27
than
0.26
Activations Density 0.113%