INDEX
Explanations
references to historical events and themes
New Auto-Interp
Negative Logits
chwitz
-0.16
Tribal
-0.16
slideUp
-0.15
éijij
-0.15
urette
-0.15
polator
-0.15
ovel
-0.15
orro
-0.15
arpa
-0.14
ivant
-0.14
POSITIVE LOGITS
histor
0.18
labour
0.16
elites
0.15
Registers
0.15
asters
0.14
reass
0.14
aters
0.14
masculinity
0.14
æľŁ
0.14
133
0.14
Activations Density 0.106%