INDEX
Explanations
mentions of excessive consumption or indulgence in activities such as eating, drinking, media-watching, and spending
references to binge behavior, especially related to entertainment and consumption
New Auto-Interp
Negative Logits
Rossi
-0.72
ento
-0.71
ansas
-0.68
nces
-0.68
bureau
-0.67
orea
-0.66
rious
-0.66
netic
-0.65
çĭ
-0.64
Manila
-0.64
POSITIVE LOGITS
vironment
0.93
uth
0.92
ervative
0.87
steen
0.84
terday
0.81
zyme
0.77
ð
0.75
Ò
0.73
imated
0.73
verend
0.71
Activations Density 0.086%