INDEX
Explanations
proper nouns related to locations, specifically locations in California
references to the state of California
New Auto-Interp
Negative Logits
BOOK
-0.80
buster
-0.77
Crimean
-0.73
schild
-0.68
balance
-0.68
minded
-0.67
favour
-0.67
denomin
-0.65
busters
-0.63
SHIP
-0.63
POSITIVE LOGITS
Calif
1.08
.,
0.90
Oro
0.82
isco
0.80
ano
0.80
uti
0.79
eno
0.79
onte
0.78
illo
0.77
olla
0.76
Activations Density 0.008%