INDEX
Explanations
references to casinos and gambling-related terminology
New Auto-Interp
Negative Logits
...↵
-0.33
....↵
-0.26
...↵↵
-0.23
↵
-0.22
...↵
-0.22
"
-0.21
â̝
-0.20
...
-0.19
López
-0.19
,...↵
-0.18
POSITIVE LOGITS
casino
0.54
poker
0.52
blackjack
0.49
Casino
0.48
roulette
0.47
Poker
0.46
gambling
0.45
Blackjack
0.43
slot
0.43
Gambling
0.42
Activations Density 0.270%