INDEX
Explanations
legal and ethical terms or concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1133
+0.14
0.8%
251
+0.12
0.7%
1581
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.14
0.03
1133
+0.12
0.02
1793
+0.11
0.03
Negative Logits
<bos>
-2.12
ganu
-0.69
Aholisi
-0.60
مرئيه
-0.57
حياتها
-0.57
principalColumn
-0.56
Palmar
-0.56
uska
-0.56
//-----
-0.55
Ок
-0.55
POSITIVE LOGITS
jaya
1.43
sarili
1.27
tanong
1.21
susun
1.19
pagkak
1.19
jati
1.17
kanya
1.16
muna
1.14
jawa
1.14
Ethics
1.14
Activations Density 0.263%