INDEX
Explanations
proper nouns related to intelligence agencies and investigations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
663
+0.20
0.9%
479
+0.20
0.9%
757
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
663
+0.20
0.03
479
+0.20
0.03
1272
+0.14
0.02
Negative Logits
كومونز
-0.70
unlaw
-0.59
superintend
-0.59
liberality
-0.58
gratify
-0.58
friable
-0.57
tolerably
-0.57
mortgagee
-0.56
ougars
-0.56
unspeak
-0.54
POSITIVE LOGITS
Cla
1.66
Cla
1.58
cla
1.47
cla
1.36
CLA
1.22
CLA
1.01
Clare
0.93
kla
0.87
Clare
0.87
claws
0.85
Activations Density 0.115%