INDEX
Explanations
comparative adjectives describing size, age, or quality
New Auto-Interp
Negative Logits
dest
-0.15
iego
-0.15
owel
-0.14
ign
-0.14
y
-0.14
recip
-0.14
ÄĻ
-0.14
erk
-0.14
likeness
-0.14
eman
-0.14
POSITIVE LOGITS
portions
0.16
ones
0.16
(lower
0.16
Ones
0.16
-than
0.15
niż
0.15
поба
0.15
versions
0.14
LT
0.14
acters
0.14
Activations Density 0.092%