INDEX
Explanations
phrases indicating a call to action or directive to do something
imperative commands
New Auto-Interp
Negative Logits
advertised
-0.69
eers
-0.68
Cong
-0.65
david
-0.65
loo
-0.64
iege
-0.63
nesses
-0.63
agre
-0.62
tem
-0.61
brill
-0.60
POSITIVE LOGITS
aways
1.23
advantage
0.98
overs
0.97
aback
0.93
heed
0.84
away
0.79
OVER
0.78
YR
0.78
autions
0.77
ume
0.77
Activations Density 0.105%