INDEX
Explanations
information about observations or sightings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1691
+0.13
0.4%
1491
+0.12
0.4%
1839
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1691
+0.13
0.05
549
+0.12
0.05
1491
+0.11
0.04
Negative Logits
trouvait
-0.49
Agregar
-0.48
Referanser
-0.48
Eliminar
-0.43
nomme
-0.43
듦
-0.43
Iné
-0.43
Referencer
-0.42
Kole
-0.42
ochial
-0.42
POSITIVE LOGITS
Saw
1.06
saw
1.01
saw
0.99
Saw
0.97
Sees
0.92
saws
0.89
SEEN
0.88
SAW
0.87
sawing
0.85
thut
0.85
Activations Density 0.116%