INDEX
Explanations
mentions of different oceans, particularly the Pacific Ocean, in the text
mentions of specific oceanic locations
New Auto-Interp
Negative Logits
<+
-0.71
ts
-0.69
stiffness
-0.66
xual
-0.66
lik
-0.66
CRIP
-0.64
dos
-0.64
gh
-0.63
==
-0.63
hasht
-0.62
POSITIVE LOGITS
Ocean
3.96
Ocean
3.12
ocean
2.11
Seas
1.68
cean
1.64
oceans
1.55
ceans
1.53
Sea
1.49
Atlantic
1.45
Neptune
1.38
Activations Density 0.012%