INDEX
Explanations
references to historical figures and events
New Auto-Interp
Negative Logits
.Magic
-0.17
rade
-0.17
ropolis
-0.16
EMPLARY
-0.16
alta
-0.15
ãĤ¶ãĥ¼
-0.15
chner
-0.15
_magic
-0.15
ække
-0.15
kê
-0.14
POSITIVE LOGITS
Tud
0.33
Parliament
0.28
Henry
0.26
English
0.23
parliament
0.23
tud
0.23
Henry
0.22
Crom
0.22
Plant
0.21
England
0.20
Activations Density 0.053%