INDEX
Explanations
references to geographical directions, particularly focused on "north."
New Auto-Interp
Negative Logits
aphael
-0.16
erate
-0.15
pany
-0.15
edla
-0.15
ãĥ¼ãĥIJ
-0.15
ź
-0.15
ÑĪими
-0.14
etur
-0.14
peria
-0.14
AWN
-0.14
POSITIVE LOGITS
western
0.35
ward
0.30
-east
0.28
-west
0.26
bound
0.26
west
0.25
umberland
0.23
ampton
0.23
rup
0.22
wards
0.22
Activations Density 0.048%