INDEX
Explanations
adverbs that convey surprise or emphasis
New Auto-Interp
Negative Logits
ays
-0.15
åħĭæĸ¯
-0.14
åŀ
-0.14
lst
-0.14
Leban
-0.14
Hairst
-0.14
Bernardino
-0.13
wsz
-0.13
å¾³
-0.13
Monter
-0.13
POSITIVE LOGITS
forge
0.17
ÏĢη
0.16
lijk
0.16
iae
0.16
GGLE
0.15
ÏĩεδÏĮν
0.15
amble
0.15
Holl
0.15
omal
0.15
ably
0.15
Activations Density 0.066%