INDEX
Explanations
comparative adjectives related to size and strength
expressions relating to comparisons and improvements
New Auto-Interp
Negative Logits
¬¼
-0.76
pedia
-0.76
afety
-0.71
anka
-0.66
igslist
-0.65
oyd
-0.65
uta
-0.64
rets
-0.62
mberg
-0.62
senal
-0.61
POSITIVE LOGITS
than
1.49
clearer
1.28
wiser
1.22
harsher
1.17
stronger
1.17
than
1.16
quicker
1.15
richer
1.14
faster
1.13
heavier
1.12
Activations Density 0.232%