INDEX
Explanations
references to historical artifacts and their significance
New Auto-Interp
Negative Logits
native
-0.16
tank
-0.15
輪
-0.14
Rim
-0.14
ura
-0.14
ืà¸Ńà¸Ķ
-0.14
Schumer
-0.14
Ragnar
-0.14
AreaView
-0.14
OLOR
-0.14
POSITIVE LOGITS
Homo
0.27
Austral
0.24
idelberg
0.24
habil
0.22
ustral
0.21
Denis
0.21
upright
0.21
Erect
0.19
Paleo
0.19
ensis
0.19
Activations Density 0.017%