INDEX
Explanations
references to historical figures and events associated with the American West
New Auto-Interp
Negative Logits
InputElement
-0.15
ÙĦÙĪØ¨
-0.14
erate
-0.14
ÑĮко
-0.13
seaside
-0.13
umes
-0.13
anko
-0.13
wal
-0.13
hrad
-0.13
ÑĢÑĥз
-0.13
POSITIVE LOGITS
frontier
0.40
Frontier
0.35
front
0.33
west
0.33
Wild
0.31
pioneer
0.30
front
0.30
Front
0.28
Manifest
0.28
pione
0.27
Activations Density 0.087%