INDEX
Explanations
statements from spokespersons or representatives
Instances of the word "said"
New Auto-Interp
Negative Logits
otin
-0.89
à¦
-0.84
=~=~
-0.77
ï¸
-0.69
advertising
-0.69
ILCS
-0.67
otion
-0.67
COR
-0.66
Holy
-0.66
âĸ
-0.65
POSITIVE LOGITS
doms
0.94
bluntly
0.79
anecd
0.79
afterward
0.77
afterwards
0.73
goodbye
0.71
sarcast
0.69
yesterday
0.67
earlier
0.65
heit
0.64
Activations Density 0.307%