INDEX
Explanations
phrases where the speaker expresses personal preferences or experiences
phrases expressing strong personal preferences or fandom
New Auto-Interp
Negative Logits
eworks
-0.80
bars
-0.71
querade
-0.69
omin
-0.69
alternatives
-0.67
rooms
-0.67
iths
-0.66
arrangements
-0.66
elight
-0.66
mates
-0.65
POSITIVE LOGITS
believer
1.39
sucker
1.28
proponent
1.20
fan
1.20
subscriber
1.04
programmer
1.03
skept
1.02
follower
1.01
lover
1.00
gamer
0.99
Activations Density 0.107%