INDEX
Explanations
references to World War II (WWII) and related historical or military terms
mentions of the acronym "WW" and its related contexts
New Auto-Interp
Negative Logits
Downloadha
-0.84
succeeding
-0.80
uated
-0.76
alties
-0.73
staking
-0.71
Reviewer
-0.70
uate
-0.69
ioned
-0.66
yielding
-0.66
etsk
-0.66
POSITIVE LOGITS
WW
1.26
ombat
0.94
WF
0.85
JD
0.85
icz
0.81
restling
0.81
WW
0.80
AMP
0.80
urst
0.79
Norton
0.79
Activations Density 0.004%