INDEX
Explanations
names of places and people
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.22
0.7%
227
+0.12
0.3%
764
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.22
0.04
1343
+0.12
0.04
981
+0.10
0.04
Negative Logits
<bos>
-1.46
kháu
-0.95
fromnode
-0.94
kasarigan
-0.91
SharedDtor
-0.87
Italijani
-0.86
nahilalakip
-0.85
disambiguazione
-0.83
Paglinawan
-0.83
Jeografia
-0.82
POSITIVE LOGITS
Juf
1.66
Gorb
1.59
maneu
1.58
encomp
1.57
deleter
1.57
fuf
1.57
strick
1.56
depic
1.55
Timp
1.54
Epif
1.54
Activations Density 0.118%