INDEX
Explanations
quotations marks followed by emotional or assertive statements
quotations or dialogue in the text
New Auto-Interp
Negative Logits
etheless
-0.92
eers
-0.78
erity
-0.75
pec
-0.74
inarily
-0.72
adem
-0.71
Ĥª
-0.70
atical
-0.68
itionally
-0.68
redit
-0.68
POSITIVE LOGITS
said
1.00
said
0.99
joked
0.97
reads
0.96
says
0.92
exclaimed
0.89
tweeted
0.89
replied
0.89
recalls
0.85
remembers
0.85
Activations Density 0.095%