INDEX
Explanations
references to geopolitical entities, particularly the term "Republic"
mentions of "Republic" in various contexts
New Auto-Interp
Negative Logits
Oracle
-0.77
balls
-0.74
Cue
-0.68
Bird
-0.63
TERN
-0.62
berger
-0.61
Panther
-0.59
Predator
-0.59
jay
-0.58
lder
-0.58
POSITIVE LOGITS
Republic
1.01
rats
0.99
Republic
0.96
oslov
0.95
ation
0.94
ans
0.91
onia
0.90
republic
0.89
arat
0.89
acies
0.88
Activations Density 0.029%