INDEX
Explanations
mentions of gambling-related terms
mentions of the term "gamble."
New Auto-Interp
Negative Logits
PRESS
-0.63
WARD
-0.61
4000
-0.60
Jed
-0.60
Fax
-0.60
Dame
-0.60
ENCE
-0.58
3000
-0.58
Patriot
-0.57
7601
-0.57
POSITIVE LOGITS
gam
1.33
blers
1.00
ulative
0.90
ulation
0.90
ulations
0.87
reens
0.84
opal
0.83
estone
0.83
romeda
0.80
raph
0.80
Activations Density 0.003%