INDEX
Explanations
phrases indicating an excessive or negative sentiment towards a particular subject
phrases emphasizing excessive quantity or severity
New Auto-Interp
Negative Logits
enance
-0.83
quo
-0.73
emer
-0.66
itatively
-0.65
è¦ļéĨĴ
-0.63
ograph
-0.63
defending
-0.63
Ips
-0.62
à¨
-0.61
farious
-0.61
POSITIVE LOGITS
Too
1.11
BuyableInstoreAndOnline
0.82
Enough
0.81
utonium
0.78
Too
0.77
ifax
0.72
bad
0.71
username
0.69
len
0.67
Klux
0.67
Activations Density 0.006%