INDEX
Explanations
comparative phrases that involve describing increasing or worsening intensity
New Auto-Interp
Negative Logits
¬¼
-0.84
pedia
-0.71
anka
-0.70
afety
-0.68
uta
-0.64
senal
-0.63
ctuary
-0.63
oyd
-0.61
acly
-0.60
ORPG
-0.60
POSITIVE LOGITS
than
1.59
quicker
1.30
clearer
1.29
faster
1.26
wiser
1.26
richer
1.24
stronger
1.23
slower
1.22
healthier
1.22
than
1.22
Activations Density 0.280%