INDEX
Explanations
locations or cities, specifically in California, as indicated by a high activation for the term 'Calif'
references to California
New Auto-Interp
Negative Logits
BOOK
-0.78
buster
-0.74
Crimean
-0.73
favour
-0.72
schild
-0.68
TPPStreamerBot
-0.66
balance
-0.66
denomin
-0.66
cater
-0.64
theless
-0.63
POSITIVE LOGITS
Calif
1.07
.,
0.88
eno
0.88
ano
0.87
ubs
0.83
illo
0.80
Oro
0.78
uti
0.78
uce
0.77
isco
0.76
Activations Density 0.011%