INDEX
Explanations
directional terms and references to locations
New Auto-Interp
Negative Logits
utr
-0.18
uetype
-0.17
north
-0.16
northern
-0.16
IBE
-0.15
south
-0.15
ça
-0.15
inson
-0.15
BackPressed
-0.15
güney
-0.14
POSITIVE LOGITS
ward
0.43
wards
0.40
bound
0.39
WARD
0.29
Bound
0.26
WARDS
0.25
BOUND
0.25
wards
0.24
bounds
0.22
ward
0.22
Activations Density 0.021%