INDEX
Explanations
requests or invitations for assistance or action
conditional phrases or requests
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.72
Kills
-0.64
Delicious
-0.62
Marse
-0.59
Romeo
-0.59
Hits
-0.58
Inferno
-0.57
Devils
-0.56
Rankings
-0.56
Kaf
-0.56
POSITIVE LOGITS
prefer
0.92
igslist
0.92
dare
0.90
iage
0.89
indulge
0.86
pless
0.83
kindly
0.82
willingly
0.80
want
0.79
ogle
0.75
Activations Density 0.111%