INDEX
Explanations
terms and phrases related to legal or technical definitions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
270
+0.11
0.3%
16
+0.10
0.3%
1978
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
270
+0.11
0.04
878
+0.10
0.04
1601
+0.08
0.03
Negative Logits
panik
-0.78
plak
-0.73
adal
-0.72
kask
-0.71
torba
-0.70
kaos
-0.70
malin
-0.70
kras
-0.69
migli
-0.67
interv
-0.66
POSITIVE LOGITS
"
0.83
“
0.78
'
0.76
‘
0.74
«
0.70
‘‘
0.68
actionTypes
0.64
„
0.63
Ename
0.63
``
0.63
Activations Density 0.163%