INDEX
Explanations
crypto-related terms and references
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1194
+0.11
0.4%
75
+0.11
0.4%
1425
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1597
+0.11
0.05
227
+0.11
0.10
1425
+0.10
0.07
Negative Logits
territo
-0.66
saad
-0.64
parency
-0.61
naer
-0.61
maraming
-0.61
najbol
-0.59
loob
-0.59
pagkak
-0.58
maksi
-0.58
taas
-0.57
POSITIVE LOGITS
P
0.62
P
0.57
getP
0.53
PS
0.52
p
0.51
p
0.50
getP
0.49
PI
0.49
PF
0.48
setP
0.48
Activations Density 0.912%