INDEX
Explanations
words related to government policies and programs, as well as organizations and names associated with political and educational matters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
678
+0.13
0.4%
50
+0.13
0.4%
24
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.13
0.05
24
+0.13
0.05
16
+0.09
0.06
Negative Logits
JAKARTA
-0.53
URBANA
-0.51
ventre
-0.51
manteau
-0.49
Witam
-0.48
tuyau
-0.48
YMMV
-0.47
jakby
-0.47
<?
-0.46
rouleau
-0.46
POSITIVE LOGITS
pama
0.62
milla
0.58
démoc
0.57
skimage
0.57
beren
0.57
susun
0.54
Nö
0.54
magis
0.54
Expt
0.54
pymysql
0.53
Activations Density 0.379%