INDEX
Explanations
instances of the word "South" in various contexts
New Auto-Interp
Negative Logits
oda
-0.16
yte
-0.16
oy
-0.15
ixa
-0.15
ERA
-0.15
kovi
-0.15
puter
-0.15
enge
-0.15
erie
-0.15
erate
-0.14
POSITIVE LOGITS
wick
0.26
western
0.25
wards
0.25
ward
0.25
side
0.24
Dakota
0.24
aven
0.24
Africa
0.23
Carolina
0.22
bound
0.22
Activations Density 0.022%