INDEX
Explanations
mentions of "Pacific" or related geographical terms
New Auto-Interp
Negative Logits
owell
-0.15
pais
-0.15
Threshold
-0.15
ì½
-0.15
ilon
-0.14
elier
-0.14
nej
-0.14
Monte
-0.14
terraform
-0.14
Ã¥r
-0.14
POSITIVE LOGITS
Rim
0.29
ally
0.24
o
0.21
ific
0.20
rim
0.20
rim
0.20
IFIC
0.19
Ocean
0.19
Coast
0.18
rones
0.18
Activations Density 0.010%