INDEX
Explanations
words and phrases that express concern and emotional impact
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.58
riched
-0.57
roots
-0.54
©¶æ
-0.53
pione
-0.53
anqu
-0.52
ospital
-0.52
aughtered
-0.52
idal
-0.52
ideon
-0.51
POSITIVE LOGITS
:)
1.28
;)
1.25
haha
1.20
ðŁĻĤ
1.19
:-)
1.16
ðŁĺ
1.03
lol
1.01
:(
1.01
tho
0.98
anyways
0.93
Activations Density 0.520%