INDEX
Explanations
instances of text related to complaints or complaining
New Auto-Interp
Negative Logits
arnaev
-0.71
assisted
-0.65
elta
-0.62
tein
-0.62
arta
-0.61
ivot
-0.61
ahime
-0.60
uton
-0.60
negie
-0.59
omez
-0.59
POSITIVE LOGITS
bitterly
1.33
loudly
1.22
about
1.00
incess
0.92
aloud
0.92
angrily
0.90
louder
0.88
complaining
0.87
isance
0.86
vehemently
0.86
Activations Density 0.038%