INDEX
    Explanations

    references to violence or attacks involving firearms

    New Auto-Interp
    Negative Logits
     виправивши
    -0.89
    expandindo
    -0.77
     الحره
    -0.77
     referenties
    -0.74
    abestanden
    -0.74
    TestingModule
    -0.73
    ]")]
    -0.73
     />);
    -0.73
    "]);
    
    -0.72
    SourceChecksum
    -0.72
    POSITIVE LOGITS
     according
    0.76
    according
    0.61
     said
    0.59
     says
    0.57
     volgens
    0.56
    iddhartha
    0.50
    AsyncResult
    0.49
    ಂತ
    0.48
    According
    0.47
     menurut
    0.46
    Act Density 0.068%

    No Known Activations