INDEX
Explanations
phrases indicating quantity or comparison
New Auto-Interp
Negative Logits
very
-0.30
Very
-0.25
very
-0.22
Very
-0.22
å¾Ī
-0.21
muito
-0.21
VERY
-0.19
quite
-0.19
molto
-0.18
more
-0.18
POSITIVE LOGITS
tarde
0.23
importante
0.18
preci
0.17
vast
0.16
grande
0.16
κον
0.16
antig
0.15
alta
0.15
alto
0.15
_advanced
0.15
Activations Density 0.014%