INDEX
    Explanations

    references to gangs and gang-related terminology

    New Auto-Interp
    Negative Logits
    "}")
    -0.54
    "}},
    -0.48
    "](
    -0.47
    存于互联网档案馆
    -0.47
    }")]
    -0.46
    ')))
    -0.44
    addGap
    -0.44
    "])
    
    -0.43
    ')")
    -0.43
    }))
    -0.43
    POSITIVE LOGITS
     Gang
    1.14
     gang
    1.08
    Gang
    1.06
     gangs
    1.02
     feroit
    0.91
     ainfi
    0.89
    gang
    0.87
     gangsters
    0.76
     gangster
    0.75
     pouvoit
    0.73
    Act Density 0.005%

    No Known Activations