INDEX
Explanations
uniquely numerical information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
468
+0.14
0.5%
1937
+0.12
0.4%
2004
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
468
+0.14
0.05
1937
+0.12
0.06
2004
+0.10
0.05
Negative Logits
susun
-0.90
sarili
-0.86
tanong
-0.83
kafe
-0.79
bawat
-0.78
seksi
-0.72
pagkak
-0.72
Himo
-0.71
betweenstory
-0.71
panahon
-0.69
POSITIVE LOGITS
apprehen
0.96
intersper
0.92
gaily
0.76
strick
0.76
endeavouring
0.74
prolly
0.74
impelled
0.72
definately
0.72
ineffec
0.71
guarante
0.70
Activations Density 0.170%