INDEX
Explanations
intensifiers, particularly the word "really," indicating strong emphasis or enthusiasm
New Auto-Interp
Negative Logits
raz
-0.16
olum
-0.15
ÙĨÙĩ
-0.14
elight
-0.14
isContained
-0.14
absol
-0.14
osa
-0.13
eya
-0.13
osas
-0.13
ãĤĪãĤĬ
-0.13
POSITIVE LOGITS
,re
0.18
-high
0.18
nice
0.18
anol
0.17
important
0.17
obvious
0.16
easy
0.16
nicely
0.16
Heller
0.16
altar
0.16
Activations Density 0.031%