INDEX
    Explanations

    violent actions resulting in physical harm

    New Auto-Interp
    Negative Logits
     Normdatei
    -0.81
    原始内容存档于
    -0.79
     ujednoznacz
    -0.73
     lenker
    -0.72
     autorytatywna
    -0.71
     kasarigan
    -0.68
    ArgsConstructor
    -0.68
    RegressionTest
    -0.68
    setVerticalGroup
    -0.65
    AntiForgeryToken
    -0.65
    POSITIVE LOGITS
     landed
    0.56
     landing
    0.56
     hits
    0.47
     lands
    0.47
    Landing
    0.45
    borderBottom
    0.43
    からは
    0.42
     hitting
    0.42
     crashes
    0.41
     noise
    0.41
    Act Density 2.009%

    No Known Activations