INDEX
Explanations
dialogue excerpts with emotional expressions showing uncertainty or importance
New Auto-Interp
Negative Logits
quickShipAvailable
-0.81
etheless
-0.70
ioned
-0.70
unal
-0.69
cephal
-0.68
actionDate
-0.66
Mand
-0.64
surprisingly
-0.63
inction
-0.61
Flavoring
-0.61
POSITIVE LOGITS
'."
1.29
.'"
1.20
',"
1.19
'"
1.16
!'"
1.15
,'"
1.12
?'"
1.05
.")
1.04
ãĢı
1.02
â̦"
0.98
Activations Density 0.250%