INDEX
Explanations
negative sentiments and expressions of doubt or frustration
New Auto-Interp
Negative Logits
ninger
-0.19
itemprop
-0.17
opes
-0.16
thumbs
-0.15
annon
-0.14
athlon
-0.14
undi
-0.14
StatusLabel
-0.14
han
-0.13
scop
-0.13
POSITIVE LOGITS
sniff
0.19
panic
0.18
quit
0.17
pan
0.16
necessarily
0.16
panicked
0.15
crack
0.15
ustain
0.15
many
0.15
ADOR
0.15
Activations Density 0.066%