INDEX
Explanations
the presence of a specific term or keyword related to 'ag' or 'agencies'
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
443
+0.13
0.7%
87
+0.13
0.7%
283
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
283
+0.13
0.04
443
+0.13
0.03
465
+0.12
0.04
Negative Logits
Ĥ¬
-2.12
ĸ´
-2.05
Ĥ
-2.00
´
-1.97
¨
-1.92
ĭ
-1.87
ħ
-1.85
?)
-1.83
?).
-1.81
ı
-1.79
POSITIVE LOGITS
gregation
1.70
gered
1.66
doll
1.54
RET
1.50
bucks
1.45
rangian
1.45
submissions
1.44
gers
1.44
exhausted
1.42
ostic
1.39
Activations Density 0.021%