INDEX
Explanations
references to Native American terms or imagery
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1896
+0.20
0.8%
1983
+0.15
0.6%
1272
+0.15
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1896
+0.20
0.04
1141
+0.15
0.04
981
+0.15
0.05
Negative Logits
kuku
-0.58
wani
-0.52
kuti
-0.51
ħħ
-0.50
jati
-0.49
l
-0.48
Crosse
-0.48
habhar
-0.47
Sush
-0.47
kuli
-0.47
POSITIVE LOGITS
coar
1.04
uncin
0.96
overla
0.95
effe
0.94
dispen
0.94
suspic
0.91
igno
0.90
robus
0.87
embra
0.87
bayern
0.86
Activations Density 0.230%