INDEX
Explanations
keywords related to politics, advocacy, and government actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
394
+0.17
0.5%
50
+0.14
0.4%
783
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
394
+0.17
0.07
1806
+0.14
0.07
783
+0.12
0.06
Negative Logits
Halen
-0.76
<bos>
-0.76
Билгалдахарш
-0.74
كومونز
-0.74
uska
-0.73
NOSIS
-0.72
***!
-0.71
IDTH
-0.70
XMLSchema
-0.70
cèse
-0.70
POSITIVE LOGITS
intersper
2.50
increa
2.30
guarante
2.22
impra
2.17
inev
2.17
encomp
2.17
reluct
2.17
fta
2.15
fuf
2.14
purcha
2.12
Activations Density 0.445%