INDEX
Explanations
phrases related to geographical directions
geographical directions or regions
New Auto-Interp
Negative Logits
Coffin
-0.67
decor
-0.61
neural
-0.60
complete
-0.60
induct
-0.59
Honor
-0.59
consumption
-0.59
actual
-0.58
literal
-0.58
Pun
-0.57
POSITIVE LOGITS
west
4.30
east
3.14
western
2.15
south
1.89
north
1.75
West
1.65
East
1.59
outheast
1.53
heast
1.51
central
1.38
Activations Density 0.021%