INDEX
Explanations
instances of the word "compared."
New Auto-Interp
Negative Logits
rite
-0.67
i
-0.63
o
-0.61
ст
-0.59
zy
-0.58
Bbb
-0.57
y
-0.56
purpose
-0.55
Man
-0.55
Plan
-0.55
POSITIVE LOGITS
compared
2.05
compared
1.91
Compared
1.81
Compared
1.72
rispetto
1.23
dibandingkan
1.16
compares
1.14
jäm
1.13
сравнению
1.10
Мексичка
1.09
Activations Density 0.065%