INDEX
Explanations
references to locations, specifically Washington and related geographical contexts
New Auto-Interp
Negative Logits
Tennessee
-0.18
Bermuda
-0.17
Illinois
-0.17
Pennsylvania
-0.15
Florida
-0.15
oc
-0.15
Connecticut
-0.14
Texas
-0.14
Alabama
-0.14
077
-0.14
POSITIVE LOGITS
Seattle
0.64
Seattle
0.58
Tacoma
0.57
Spokane
0.50
Pu
0.45
Seahawks
0.45
Vancouver
0.44
WA
0.39
okane
0.38
Yak
0.37
Activations Density 0.240%