INDEX
Explanations
mentions of bodies of water, particularly oceans
references to oceanic environments
New Auto-Interp
Negative Logits
Reloaded
-0.83
TI
-0.79
POS
-0.75
nder
-0.74
TOP
-0.69
NER
-0.69
DIR
-0.67
mingham
-0.66
puff
-0.65
Filename
-0.65
POSITIVE LOGITS
front
1.09
ographer
1.03
basin
1.03
liner
0.97
Ocean
0.92
ocean
0.91
ographers
0.89
circulation
0.86
ographic
0.86
waters
0.85
Activations Density 0.009%