INDEX
Explanations
terms related to conflict, war-torn areas, and infrastructure projects in specific regions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.15
0.5%
1385
+0.15
0.4%
964
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
198
+0.15
0.07
143
+0.15
0.05
1385
+0.13
0.05
Negative Logits
disreg
-0.89
impra
-0.87
rendono
-0.75
shenan
-0.74
encomp
-0.70
intersper
-0.66
pooh
-0.66
uninten
-0.64
apprehen
-0.63
increa
-0.63
POSITIVE LOGITS
areas
0.90
regions
0.81
environments
0.77
areas
0.70
countries
0.68
districts
0.68
zones
0.67
neighborhoods
0.66
area
0.66
monaster
0.63
Activations Density 0.373%