INDEX
Explanations
mentions of public figures giving speeches or statements in the news
instances of the word "spoke" indicating dialogue or speech by various individuals
New Auto-Interp
Negative Logits
aredevil
-0.72
rypt
-0.70
ILCS
-0.66
mental
-0.66
rawler
-0.66
anny
-0.65
assi
-0.64
brance
-0.64
ween
-0.62
involved
-0.62
POSITIVE LOGITS
volumes
0.87
aloud
0.86
omin
0.77
transcripts
0.76
harshly
0.72
onstage
0.72
louder
0.70
passionately
0.70
glow
0.70
mute
0.69
Activations Density 0.029%