INDEX
Explanations
references to the sea or ocean-related terms
New Auto-Interp
Negative Logits
myſelf
-0.95
themſelves
-0.90
purpoſe
-0.88
Chwiliwch
-0.87
himſelf
-0.83
μφωνα
-0.82
ciled
-0.80
itſelf
-0.80
Primero
-0.78
antelope
-0.78
POSITIVE LOGITS
Sea
2.05
sea
2.00
Sea
1.91
SEA
1.79
sea
1.68
SEA
1.49
Seaman
1.01
海
0.99
海
0.98
seag
0.98
Activations Density 0.028%