INDEX
Explanations
references to political figures or politically related terms
instances of the term "political" and its variations
New Auto-Interp
Negative Logits
Carbuncle
-0.74
Warrant
-0.72
urat
-0.70
Ved
-0.69
pity
-0.68
warrants
-0.67
Gutenberg
-0.66
recall
-0.65
Blazing
-0.65
Norn
-0.65
POSITIVE LOGITS
icians
1.64
ician
1.51
ifact
1.31
ically
1.20
eness
1.16
icial
1.12
ico
1.11
ique
1.03
icans
1.02
icking
1.01
Activations Density 0.036%