INDEX
Explanations
financial activities such as giving money and discussing concerns about data falsification
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.23
0.7%
964
+0.13
0.4%
453
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.23
0.03
16
+0.13
0.05
453
+0.12
0.04
Negative Logits
">//
-0.78
<<<<<<<<<<<<<<
-0.75
())))
-0.72
LinkId
-0.70
Clik
-0.69
millimeters
-0.69
RTEE
-0.69
Wikiseite
-0.68
laborales
-0.67
famí
-0.66
POSITIVE LOGITS
reluct
2.06
snoopy
2.01
shenan
1.98
impra
1.97
depic
1.97
affor
1.89
strick
1.88
fath
1.87
unve
1.80
wherea
1.77
Activations Density 0.258%