INDEX
    Explanations

    entities associated with criminal acts

    the word "who" in various contexts

    New Auto-Interp
    Negative Logits
    ³³³³
    -0.80
     Bound
    -0.71
    BACK
    -0.68
     Delicious
    -0.67
     Glob
    -0.65
     Processing
    -0.65
     Around
    -0.65
     Dos
    -0.64
     Lions
    -0.64
     Anything
    -0.63
    POSITIVE LOGITS
    soever
    1.11
    oping
    1.02
     accompanies
    0.99
     preceded
    0.96
    oped
    0.95
     attended
    0.90
     oversaw
    0.89
    ever
    0.89
     attends
    0.89
     resided
    0.88
    Act Density 0.168%

    No Known Activations