INDEX
Explanations
references to lottery wins or gambling
references to lotteries and significant events associated with them
New Auto-Interp
Negative Logits
romising
-0.78
developed
-0.75
contained
-0.74
anti
-0.73
olulu
-0.72
OST
-0.72
angers
-0.66
arching
-0.66
orses
-0.65
ests
-0.65
POSITIVE LOGITS
tery
1.33
Tycoon
0.73
poons
0.72
lled
0.71
WARE
0.70
tsky
0.70
illac
0.70
Royale
0.69
lation
0.68
xit
0.68
Activations Density 0.019%