INDEX
Explanations
terms related to bodies of water and political rhetoric
New Auto-Interp
Negative Logits
Interstitial
-0.93
UGC
-0.89
haar
-0.82
ioch
-0.82
Bron
-0.79
ible
-0.79
ijah
-0.75
STEP
-0.72
chen
-0.72
ivable
-0.72
POSITIVE LOGITS
shore
1.11
Islands
1.03
Seas
0.97
sailing
0.97
sailed
0.96
boat
0.92
Ocean
0.91
voyage
0.91
Kraken
0.90
fisherman
0.90
Activations Density 2.221%