INDEX
Explanations
mentions of marine or sea-related terms
references to the sea or ocean
New Auto-Interp
Negative Logits
denomin
-0.71
favor
-0.70
CU
-0.65
ellipt
-0.64
prof
-0.63
Fet
-0.60
transgress
-0.60
programming
-0.59
fest
-0.58
mark
-0.58
POSITIVE LOGITS
sea
4.80
Sea
2.06
SEA
1.53
sea
1.50
Sea
1.41
marine
1.27
Seas
1.20
seaf
1.19
Ocean
1.19
worm
1.18
Activations Density 0.009%