INDEX
Explanations
terms related to politics, society, and the shaping of national character
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.16
0.5%
1870
+0.16
0.5%
2034
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
764
+0.16
0.08
1438
+0.16
0.07
24
+0.15
0.08
Negative Logits
impractica
-1.29
aen
-1.27
guarante
-1.26
affor
-1.25
sappi
-1.25
increa
-1.22
inev
-1.21
encomp
-1.21
ftu
-1.19
reluct
-1.19
POSITIVE LOGITS
.
0.72
Audiodateien
0.65
;
0.61
YNAMICS
0.61
Jahrhunderts
0.59
onomian
0.59
徊
0.58
***!
0.58
:
0.57
mybatisplus
0.57
Activations Density 0.740%