INDEX
Explanations
terms related to the legitimacy and effects of government practices and policies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
872
+0.13
0.4%
1253
+0.12
0.4%
1166
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1166
+0.13
0.06
2007
+0.12
0.03
295
+0.11
0.01
Negative Logits
apprehen
-1.09
reluct
-1.06
inev
-1.04
accla
-1.03
intersper
-1.03
depic
-0.97
emphat
-0.96
embra
-0.93
encomp
-0.93
sophistic
-0.93
POSITIVE LOGITS
emplar
0.62
ercice
0.60
gruntled
0.57
ajuns
0.57
disambiguazione
0.57
trecut
0.56
Meksiku
0.56
declarat
0.56
nor
0.54
AssemblyTitle
0.54
Activations Density 0.612%