INDEX
Explanations
mentions of the United States Navy
mentions of the Navy
New Auto-Interp
Negative Logits
oned
-0.78
DonaldTrump
-0.73
DRM
-0.69
Gree
-0.66
upon
-0.66
Gandhi
-0.64
olkien
-0.64
IBLE
-0.64
aphael
-0.64
DEAD
-0.64
POSITIVE LOGITS
SEAL
1.32
Yard
1.09
Seal
0.98
sailor
0.92
boats
0.90
boat
0.89
odon
0.89
Sea
0.86
Fisheries
0.86
ategory
0.85
Activations Density 0.025%