INDEX
Explanations
names of organizations and entities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1013
+0.09
0.3%
453
+0.07
0.2%
1870
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1494
+0.09
0.06
648
+0.07
0.04
1720
+0.07
0.03
Negative Logits
époux
-0.83
delà
-0.82
créateur
-0.80
broderie
-0.77
vainqueur
-0.74
vôtre
-0.74
ecru
-0.73
lapin
-0.72
malheureux
-0.72
coté
-0.72
POSITIVE LOGITS
Herzlich
0.70
<bos>
0.63
Junio
0.62
Tanja
0.61
Elke
0.61
Gobierno
0.61
ideolog
0.60
Siglo
0.60
Dage
0.59
Sú
0.59
Activations Density 0.429%