INDEX
Explanations
information regarding statistics and data
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
382
+0.18
0.6%
1896
+0.13
0.4%
1343
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.18
0.06
609
+0.13
0.04
615
+0.12
0.03
Negative Logits
shenan
-1.81
intersper
-1.69
unspeak
-1.67
reluct
-1.64
disreg
-1.59
impra
-1.56
indestru
-1.55
sophistic
-1.54
impractica
-1.51
maneu
-1.50
POSITIVE LOGITS
alkoh
0.82
kokos
0.81
poliuret
0.80
pól
0.80
adverten
0.78
minimalis
0.74
silikon
0.74
bunda
0.74
trö
0.73
karton
0.73
Activations Density 0.112%