INDEX
Explanations
personal reactions or emotional responses
references to engagement and audience reactions
New Auto-Interp
Negative Logits
utor
-0.78
tesy
-0.74
aband
-0.72
»Ĵ
-0.70
adiq
-0.68
subcontract
-0.66
onduct
-0.65
Pastebin
-0.64
MSN
-0.64
idable
-0.63
POSITIVE LOGITS
itching
1.34
buzzing
1.31
wondering
1.28
laughing
1.27
excited
1.25
intrigued
1.25
goose
1.24
scratching
1.20
craving
1.15
longing
1.15
Activations Density 0.195%