INDEX
Explanations
negative sentiment or criticism
New Auto-Interp
Negative Logits
stoked
-0.63
Sultan
-0.61
lax
-0.60
convol
-0.60
ranks
-0.59
lement
-0.57
ISTER
-0.56
wedd
-0.56
Archdemon
-0.55
stagn
-0.55
POSITIVE LOGITS
purpose
0.80
sight
0.78
task
0.76
oeuv
0.76
each
0.76
package
0.76
die
0.75
sent
0.74
street
0.74
one
0.74
Activations Density 0.015%