INDEX
    Explanations

    specific locations and events mentioned in a text

    references to specific locations and events related to crime or violence

    New Auto-Interp
    Negative Logits
    cipled
    -0.73
    laus
    -0.65
    cellaneous
    -0.59
    detail
    -0.55
    ³³³³³³³³³³³³³³³³
    -0.54
    enture
    -0.54
    gov
    -0.53
    ertodd
    -0.53
    areth
    -0.53
    yne
    -0.51
    POSITIVE LOGITS
     badge
    0.76
     moniker
    0.67
     persona
    0.64
    onto
    0.63
     slate
    0.63
     salute
    0.61
     barrier
    0.61
     flag
    0.59
     masterpiece
    0.59
     cra
    0.59
    Act Density 2.774%

    No Known Activations