INDEX
Explanations
mentions of New York City and its surroundings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
202
+0.15
0.9%
268
+0.12
0.6%
87
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
202
+0.15
0.03
268
+0.12
0.03
417
+0.11
0.03
Negative Logits
batim
-1.82
à±į
-1.64
ETHOD
-1.61
à°¿
-1.59
àµį
-1.57
á̏
-1.57
inco
-1.51
à¯ģ
-1.44
à¯į
-1.43
à±ģ
-1.41
POSITIVE LOGITS
esses
1.75
ĩ
1.51
itness
1.50
eness
1.40
ialog
1.40
¤
1.34
inates
1.33
ħ
1.33
command
1.32
wings
1.31
Activations Density 0.131%