INDEX
Explanations
Biblical references and religious terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
0.8%
1741
+0.10
0.4%
1592
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
499
+0.21
0.04
1959
+0.10
0.03
872
+0.07
0.04
Negative Logits
<bos>
-1.70
ⓧ
-1.50
/**
-1.29
-1.27
<?
-1.07
/*
-1.01
<?
-0.99
/***
-0.76
JAKARTA
-0.74
/*++
-0.71
POSITIVE LOGITS
maneu
1.05
increa
1.01
Minang
0.96
vry
0.94
daz
0.92
impra
0.91
lele
0.91
coö
0.90
Præ
0.90
affor
0.87
Activations Density 0.093%