INDEX
    Explanations

    references to blood and violent imagery

    New Auto-Interp
    Negative Logits
     StatelessWidget
    -0.69
    例句
    -0.64
    fjspx
    -0.59
    WarningLevel
    -0.56
     estimés
    -0.55
     StatefulWidget
    -0.54
    CreateModel
    -0.53
     whim
    -0.52
    umel
    -0.52
     متعلقه
    -0.51
    POSITIVE LOGITS
     blood
    1.52
    blood
    1.34
     bleeding
    1.29
     Blood
    1.28
    Blood
    1.28
     BLOOD
    1.22
     bleed
    1.21
     bleeds
    1.13
    BLOOD
    1.10
     bloed
    1.10
    Act Density 0.277%

    No Known Activations