INDEX
Explanations
numerical data or statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1577
+0.26
0.8%
50
+0.13
0.4%
1978
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.26
0.12
1343
+0.13
0.13
523
+0.13
0.08
Negative Logits
TagMode
-0.84
ביוגרפיה
-0.79
الدراسه
-0.75
Fordítás
-0.75
setVerticalGroup
-0.75
Serviço
-0.75
További
-0.73
جوايز
-0.73
Até
-0.70
Estou
-0.70
POSITIVE LOGITS
effe
1.60
?...
1.59
fta
1.52
squa
1.51
suscep
1.50
Intere
1.49
aen
1.48
§.
1.48
desir
1.47
embodi
1.47
Activations Density 1.306%