INDEX
Explanations
proper nouns of people related to technology and politics
mentions of specific individuals or entities, particularly those with the initial 'J.'
New Auto-Interp
Negative Logits
Anthem
-0.60
Fas
-0.58
whistle
-0.58
Nunes
-0.56
foremost
-0.56
DRAG
-0.53
shed
-0.52
Camer
-0.52
arom
-0.52
torch
-0.51
POSITIVE LOGITS
ilee
0.97
neys
0.97
usalem
0.91
iffe
0.88
ensen
0.85
unal
0.78
okia
0.77
Whedon
0.76
etus
0.76
ernaut
0.75
Activations Density 0.090%