INDEX
Explanations
mentions of geographical density or population density in a city or area
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
871
+0.10
0.3%
896
+0.09
0.3%
1491
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
127
+0.10
0.02
871
+0.09
0.02
251
+0.09
0.02
Negative Logits
parent
-0.53
Martin
-0.52
Martin
-0.49
Childs
-0.49
Wade
-0.48
Kirk
-0.48
Parent
-0.48
fund
-0.47
врат
-0.47
="#"><
-0.47
POSITIVE LOGITS
density
2.99
Density
2.92
density
2.83
densities
2.72
Density
2.62
dense
2.43
denser
2.31
DENSITY
2.31
dense
2.14
Dense
2.13
Activations Density 0.173%