INDEX
Explanations
references to specific harbors
New Auto-Interp
Negative Logits
DonaldTrump
-0.82
uable
-0.79
hesis
-0.78
iferation
-0.72
Downloadha
-0.71
ieve
-0.71
ctive
-0.69
yson
-0.68
iosis
-0.67
otine
-0.66
POSITIVE LOGITS
front
0.90
Cruise
0.84
Ship
0.82
Harbour
0.79
Shore
0.79
Yard
0.78
ships
0.75
Ships
0.73
ward
0.73
Harbor
0.72
Activations Density 0.007%