INDEX
Explanations
references to economic sanctions or embargoes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
0.9%
1896
+0.12
0.4%
1137
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
919
+0.23
0.04
1328
+0.12
0.04
1870
+0.10
0.03
Negative Logits
<bos>
-2.60
individu
-0.65
/***
-0.65
///**
-0.61
herbe
-0.60
reag
-0.60
glPushMatrix
-0.57
displayquote
-0.57
mena
-0.57
solidar
-0.57
POSITIVE LOGITS
blackish
1.03
bandung
0.95
greyish
0.92
toledo
0.89
Banjar
0.88
Minang
0.88
tolerably
0.82
purplish
0.82
miguel
0.80
élégante
0.80
Activations Density 0.219%