INDEX
Explanations
phrases indicating a source is mentioned or quoted
instances of the word "said."
New Auto-Interp
Negative Logits
à¦
-0.84
agra
-0.72
phal
-0.72
otion
-0.71
omore
-0.69
mol
-0.68
VIDEO
-0.68
ptives
-0.68
²¾
-0.68
ooter
-0.67
POSITIVE LOGITS
afterward
0.77
bluntly
0.71
doms
0.69
analysts
0.69
anecd
0.68
goodbye
0.67
sarcast
0.67
spokeswoman
0.65
afterwards
0.65
brisk
0.64
Activations Density 0.249%