INDEX
Explanations
geographical features and locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1520
+0.13
0.4%
1379
+0.11
0.3%
690
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1379
+0.13
0.06
1305
+0.11
0.06
222
+0.09
0.06
Negative Logits
was
-0.70
wasn
-0.69
Was
-0.66
was
-0.63
Was
-0.61
wasn
-0.55
WAS
-0.55
cista
-0.55
Orleans
-0.50
earlier
-0.49
POSITIVE LOGITS
maneu
1.13
chrétien
1.05
embodi
1.04
shenan
1.03
reluct
1.02
!...
1.02
suscep
1.01
scrat
1.00
?...
0.99
impra
0.99
Activations Density 0.663%