INDEX
Explanations
mentions of villages and related terms
New Auto-Interp
Negative Logits
urgy
-0.15
_cls
-0.15
Counties
-0.14
CITY
-0.14
ruz
-0.14
attern
-0.14
astle
-0.14
orage
-0.14
linger
-0.14
yx
-0.14
POSITIVE LOGITS
ois
0.23
square
0.22
-wide
0.20
wide
0.20
hall
0.19
elder
0.19
-scale
0.19
-square
0.18
elders
0.18
-state
0.18
Activations Density 0.021%