INDEX
Explanations
references to historical empires, particularly the "Empire"
mentions of "Empire" and related terms
New Auto-Interp
Negative Logits
paces
-0.78
matter
-0.76
yrics
-0.75
eret
-0.73
ociate
-0.73
hift
-0.73
onto
-0.72
giving
-0.71
arten
-0.70
electric
-0.70
POSITIVE LOGITS
Strikes
1.00
perors
0.84
conquered
0.81
Augustus
0.81
Claud
0.81
Britann
0.80
Builder
0.80
Nero
0.77
Napoleon
0.75
overth
0.75
Activations Density 0.071%