INDEX
Explanations
specific historical references and terms related to specific locations, events, or people
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.31
1.1%
1919
+0.12
0.4%
1177
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.31
0.13
862
+0.12
0.11
1097
+0.11
0.10
Negative Logits
intersper
-1.60
trouva
-1.13
universale
-1.10
exé
-1.08
écl
-1.04
soigne
-1.04
espé
-1.04
matel
-1.02
tén
-1.00
gouver
-1.00
POSITIVE LOGITS
Glej
0.92
arrol
0.81
polski
0.81
Și
0.79
sizePolicy
0.78
Paglinawan
0.78
توضیحات
0.77
المصادر
0.76
Quiénes
0.76
Referencoj
0.75
Activations Density 0.793%