INDEX
Explanations
intensifiers or adverbs that amplify meaning or emotion
New Auto-Interp
Negative Logits
swer
-0.16
fusc
-0.16
ARGS
-0.14
ulumi
-0.14
anmar
-0.14
iddy
-0.14
urous
-0.14
lopedia
-0.14
soever
-0.14
ÏĢÏīÏĤ
-0.13
POSITIVE LOGITS
aza
0.15
Sys
0.14
coincidence
0.14
жÑĥ
0.14
_like
0.14
Moor
0.14
tempted
0.13
etus
0.13
izin
0.13
ify
0.13
Activations Density 0.169%