INDEX
Explanations
legal actions or formal procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1499
+0.13
0.4%
184
+0.11
0.3%
1870
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1499
+0.13
0.07
2011
+0.11
0.04
1965
+0.11
0.05
Negative Logits
ananas
-1.28
poire
-1.27
frambo
-1.15
trico
-1.12
marte
-1.10
persil
-1.08
tén
-1.07
fluo
-1.05
aquare
-1.05
strass
-1.04
POSITIVE LOGITS
was
0.90
includes
0.88
came
0.87
went
0.85
consists
0.82
began
0.82
lasted
0.81
appeared
0.79
took
0.79
happened
0.79
Activations Density 0.303%