INDEX
Explanations
intensifiers or modifiers that express degree or extent
New Auto-Interp
Negative Logits
qus
-0.16
imore
-0.16
ugo
-0.15
ApplicationException
-0.15
uars
-0.15
ìĨ
-0.14
maf
-0.13
AREST
-0.13
Schwe
-0.13
ãĤ·ãĤ¢
-0.13
POSITIVE LOGITS
much
0.60
much
0.44
Much
0.42
MUCH
0.41
Much
0.40
mucho
0.33
many
0.30
veel
0.24
viel
0.24
beaucoup
0.24
Activations Density 0.036%