INDEX
Explanations
references to historical themes and battles in games
New Auto-Interp
Negative Logits
antan
-0.17
oose
-0.17
iaux
-0.15
Crosby
-0.15
olan
-0.15
ož
-0.14
Passport
-0.14
zp
-0.14
PJ
-0.14
NES
-0.14
POSITIVE LOGITS
Space
0.25
Tau
0.24
Nur
0.22
Chapters
0.22
Daemon
0.21
daemon
0.21
Warp
0.21
psy
0.21
Eld
0.20
Tau
0.20
Activations Density 0.010%