INDEX
    Explanations

    incidents and descriptions of property destruction

    New Auto-Interp
    Negative Logits
    å¹¹ç·ļ
    -0.17
    λικ
    -0.16
    errer
    -0.16
    Äįan
    -0.15
    arness
    -0.15
    ignal
    -0.14
    浦
    -0.14
    .xtext
    -0.14
    igar
    -0.14
    ereco
    -0.14
    POSITIVE LOGITS
    vale
    0.18
     window
    0.15
     Bender
    0.14
     Wein
    0.14
     Window
    0.14
     Rosenstein
    0.14
    QA
    0.14
     vandalism
    0.14
    åŁº
    0.14
    SF
    0.14
    Act Density 0.178%

    No Known Activations