INDEX
    Explanations

    references to strip clubs

    occurrences of the word "strip"

    New Auto-Interp
    Negative Logits
     Aval
    -0.73
    CV
    -0.71
    VERTISEMENT
    -0.71
    rious
    -0.71
    VERTIS
    -0.68
    riel
    -0.67
     AQ
    -0.67
    cause
    -0.65
     SOS
    -0.64
    pheus
    -0.64
    POSITIVE LOGITS
     strip
    1.38
    strip
    1.15
     strips
    1.13
     malls
    1.12
     stripping
    1.03
    isode
    0.93
     Strip
    0.86
     stripes
    0.84
    cloth
    0.80
     clubs
    0.80
    Act Density 0.006%

    No Known Activations