INDEX
    Explanations

    mentions of the word "pocket."

    New Auto-Interp
    Negative Logits
     inappro
    -0.78
     disagre
    -0.76
     squa
    -0.76
     unden
    -0.74
     sii
    -0.72
     wien
    -0.70
     desir
    -0.70
     emphat
    -0.70
     weber
    -0.68
     edp
    -0.68
    POSITIVE LOGITS
     pocket
    1.58
    pocket
    1.45
     Pocket
    1.42
    Pocket
    1.42
     pockets
    1.35
     Pockets
    1.04
     wallet
    0.93
    wallet
    0.90
     wallets
    0.81
    ポケット
    0.78
    Act Density 0.098%

    No Known Activations