INDEX
Explanations
strong emotional reactions or opinions expressed in text
verbs related to negative actions or complaints
New Auto-Interp
Negative Logits
nai
-0.68
Bei
-0.64
Bonds
-0.64
peria
-0.63
Asia
-0.63
ortium
-0.63
é¾įå
-0.63
Suc
-0.61
Timeline
-0.60
WIN
-0.58
POSITIVE LOGITS
essional
0.75
igious
0.75
untled
0.75
backer
0.71
sed
0.69
kefeller
0.68
quit
0.66
anks
0.64
rolled
0.64
Edited
0.64
Activations Density 0.039%