INDEX
Explanations
instances of the word "awarded" in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.17
1.0%
156
+0.17
1.0%
148
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
302
+0.17
0.02
290
+0.17
0.02
147
+0.14
0.02
Negative Logits
omorphisms
-1.78
Argued
-1.75
akin
-1.75
ck
-1.66
characterized
-1.62
Ń
-1.54
Īĺ
-1.50
uten
-1.50
omorphism
-1.50
ushed
-1.47
POSITIVE LOGITS
giving
1.94
directions
1.69
favour
1.61
ganglia
1.51
wounds
1.37
refuge
1.34
territory
1.31
range
1.29
astr
1.29
seconds
1.28
Activations Density 4.904%