INDEX
    Explanations

    references to casinos and gambling-related terminology

    New Auto-Interp
    Negative Logits
    ...↵
    -0.33
    ....↵
    -0.26
    ...↵↵
    -0.23
    -0.22
     ...↵
    -0.22
     "
    -0.21
    â̝
    -0.20
    ...
    -0.19
     López
    -0.19
    ,...↵
    -0.18
    POSITIVE LOGITS
     casino
    0.54
     poker
    0.52
     blackjack
    0.49
     Casino
    0.48
     roulette
    0.47
     Poker
    0.46
     gambling
    0.45
     Blackjack
    0.43
     slot
    0.43
     Gambling
    0.42
    Act Density 0.270%

    No Known Activations