INDEX
Explanations
words and phrases related to emotional reactions and communications
New Auto-Interp
Negative Logits
Panic
-0.16
panic
-0.16
panic
-0.15
åĭ¤
-0.15
.questions
-0.15
iment
-0.15
Asking
-0.14
ilia
-0.14
ember
-0.14
panicked
-0.14
POSITIVE LOGITS
hurt
0.18
Dipl
0.17
venes
0.15
receipts
0.15
ÚĺÛĮ
0.15
confrontation
0.15
angered
0.15
apyrus
0.15
okino
0.14
toxic
0.14
Activations Density 0.171%