INDEX
Explanations
references to government bodies, specifically the term "Congress"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1983
+0.13
0.5%
67
+0.13
0.5%
1870
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
67
+0.13
0.03
1983
+0.13
0.03
395
+0.13
0.03
Negative Logits
increa
-0.88
encomp
-0.85
intersper
-0.82
shenan
-0.82
guarante
-0.81
affor
-0.80
hairc
-0.78
fuf
-0.78
effe
-0.75
scrat
-0.75
POSITIVE LOGITS
Congress
1.37
Congress
1.24
congress
1.08
congressional
0.97
congress
0.93
Congressional
0.92
CONGRESS
0.90
Congrès
0.77
ressional
0.76
congressman
0.74
Activations Density 0.067%