INDEX
Explanations
repetition or emphasis of the word "more."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
501
+0.12
0.7%
53
+0.12
0.7%
338
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
501
+0.12
0.05
239
+0.12
0.08
284
+0.12
0.07
Negative Logits
¼
-3.70
´
-3.26
¹
-3.21
¾
-3.16
ĻĤ
-3.16
·¸
-3.15
½
-3.04
º
-2.98
ľĵ
-2.97
·
-2.92
POSITIVE LOGITS
than
3.74
Than
3.26
than
2.95
Than
2.51
judicial
1.88
manageable
1.68
forthcoming
1.66
likely
1.59
stringent
1.58
organised
1.57
Activations Density 0.199%