INDEX
Explanations
references to Christianity and Christian-related terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.12
0.5%
313
+0.12
0.4%
920
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
442
+0.12
0.02
313
+0.12
0.02
1480
+0.11
0.02
Negative Logits
Về
-0.59
Ngoài
-0.57
postIndex
-0.55
utafiti
-0.54
*****/
-0.52
setDo
-0.52
Tác
-0.51
للاسماء
-0.51
JTextArea
-0.50
Màu
-0.50
POSITIVE LOGITS
Christian
1.23
Christian
1.17
christian
1.10
CHRISTIAN
1.09
Juf
0.95
Christians
0.95
Augu
0.94
Hn
0.93
Cristi
0.92
Khart
0.90
Activations Density 0.064%