INDEX
Explanations
historical and political terms related to power struggles
references to historical events and figures
New Auto-Interp
Negative Logits
moms
-0.87
FOX
-0.86
neuroscience
-0.85
CNN
-0.81
VIDEO
-0.81
oother
-0.81
Kids
-0.80
VIDE
-0.80
Marketplace
-0.80
Trend
-0.79
POSITIVE LOGITS
besie
1.21
Napoleon
1.17
Chamberlain
1.17
Empress
1.16
treacher
1.14
treason
1.14
decree
1.13
revolt
1.11
Emperor
1.10
assassinated
1.10
Activations Density 0.536%