INDEX
Explanations
instances of complaints or expressions of dissatisfaction
New Auto-Interp
Negative Logits
WEST
-0.16
ales
-0.16
ipsis
-0.16
eming
-0.15
kits
-0.15
llx
-0.15
kles
-0.15
scopes
-0.14
car
-0.14
rok
-0.14
POSITIVE LOGITS
about
0.20
ants
0.19
complaints
0.17
Complaint
0.17
complaint
0.17
/problem
0.16
ABOUT
0.16
unct
0.15
unity
0.15
complain
0.15
Activations Density 0.019%