INDEX
Explanations
terms related to empires and imperial structures
New Auto-Interp
Negative Logits
gat
-0.72
LEncoder
-0.72
:]:
-0.71
Philist
-0.70
Dmit
-0.66
bolistas
-0.66
Thom
-0.66
Wit
-0.65
Laus
-0.63
<!--[
-0.63
POSITIVE LOGITS
Empire
1.16
Empire
1.14
EMPIRE
1.13
empire
1.04
empires
1.02
Empires
0.99
Imperio
0.94
emperors
0.93
empire
0.89
empereur
0.88
Activations Density 0.007%