INDEX
Explanations
names involving "Singh."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1942
+0.17
0.7%
597
+0.15
0.6%
1872
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.17
0.03
1872
+0.15
0.02
227
+0.14
0.03
Negative Logits
krishna
-0.65
sentito
-0.61
scopri
-0.61
aspetta
-0.58
dimenti
-0.54
lasciato
-0.52
dimentic
-0.51
trover
-0.51
raccont
-0.50
cammin
-0.50
POSITIVE LOGITS
Singh
1.15
Singh
1.09
Sikh
0.86
SINGH
0.75
Sikhs
0.74
Şi
0.70
Sing
0.68
Singapur
0.68
Châ
0.68
Lég
0.68
Activations Density 0.117%