INDEX
Explanations
strong emotional statements and accusations involving individuals or groups
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1984
+0.10
0.3%
1978
+0.09
0.3%
1437
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
658
+0.10
0.08
1470
+0.09
0.04
845
+0.08
0.04
Negative Logits
Kategor
-0.56
fjspx
-0.56
kosme
-0.56
minimalis
-0.56
İstinadlar
-0.53
bunda
-0.52
protokol
-0.51
NewRow
-0.50
basicConfig
-0.49
DbType
-0.49
POSITIVE LOGITS
yourselves
1.13
yourself
1.07
yourself
0.97
Yourself
0.94
embodi
0.80
YOURSELF
0.80
youre
0.80
your
0.77
Yourself
0.76
resear
0.75
Activations Density 0.653%