INDEX
Explanations
statements made by individuals
instances of direct speech or quotations
New Auto-Interp
Negative Logits
pes
-0.95
folios
-0.90
arette
-0.79
qqa
-0.76
haar
-0.76
cffffcc
-0.75
osponsors
-0.75
olen
-0.74
Kin
-0.72
mb
-0.70
POSITIVE LOGITS
goodbye
0.87
afterward
0.87
NBA
0.83
bluntly
0.79
ESPN
0.78
Blackhawks
0.77
STATS
0.76
offseason
0.76
Spartan
0.75
sarcast
0.75
Activations Density 0.134%