INDEX
Explanations
mentions of specific types of structures, specifically buildings
terms related to buildings and construction
New Auto-Interp
Negative Logits
nir
-0.79
ño
-0.73
sha
-0.71
sen
-0.69
owder
-0.68
nia
-0.67
romeda
-0.65
tle
-0.64
pas
-0.64
Nadu
-0.64
POSITIVE LOGITS
buildings
1.18
raper
1.08
Buildings
1.05
chool
0.96
skysc
0.95
blocks
0.91
erected
0.89
complexes
0.89
constructed
0.84
towers
0.83
Activations Density 0.019%