INDEX
    Explanations

    references to various forms of violence or weaponry

    New Auto-Interp
    Negative Logits
     فريبيس
    -0.56
     épaules
    -0.53
     lèvres
    -0.52
    uxxxx
    -0.49
     паспорт
    -0.49
     Skirt
    -0.49
    GEBURTSDATUM
    -0.49
    gown
    -0.48
     pacemaker
    -0.48
     sweatshirt
    -0.48
    POSITIVE LOGITS
     umbrellas
    0.81
     Bibles
    0.79
     pens
    0.75
     knives
    0.75
     swords
    0.74
     backpacks
    0.73
     guitars
    0.73
     shovels
    0.72
     bottles
    0.72
     chairs
    0.71
    Act Density 0.741%

    No Known Activations