INDEX
Explanations
mentions of the word "East" in various contexts
New Auto-Interp
Negative Logits
ottes
-0.16
uman
-0.15
æĹıèĩªæ²»
-0.15
otti
-0.14
á»ĭ
-0.14
ovel
-0.14
antro
-0.14
odore
-0.14
sted
-0.14
ity
-0.14
POSITIVE LOGITS
ablish
0.33
ward
0.30
bourne
0.27
side
0.26
Coast
0.26
erner
0.26
coast
0.26
ertime
0.26
ern
0.25
ERN
0.24
Activations Density 0.026%