INDEX
Explanations
intensifiers or modifiers that emphasize degree or quantity
New Auto-Interp
Negative Logits
éal
-0.16
ritz
-0.15
Ïģιν
-0.15
,eg
-0.14
ieux
-0.14
linger
-0.14
ÏĦεÏĤ
-0.14
iliki
-0.14
ë¬
-0.14
igua
-0.13
POSITIVE LOGITS
Cornel
0.18
tro
0.16
many
0.16
üm
0.15
ŀ
0.15
other
0.15
ther
0.15
_
0.14
many
0.14
Verm
0.14
Activations Density 0.050%