INDEX
    Explanations

    violence/ abuse

    New Auto-Interp
    Negative Logits
    ram
    -0.07
     specialists
    -0.07
     Implements
    -0.06
     Ninth
    -0.06
    .download
    -0.06
    .Compare
    -0.06
     Supports
    -0.06
    chemistry
    -0.06
    sd
    -0.06
     EMS
    -0.06
    POSITIVE LOGITS
    0.08
     antique
    0.07
     Hak
    0.07
     testers
    0.07
     histoire
    0.06
    .Il
    0.06
    (prom
    0.06
     nová
    0.06
    .groupBox
    0.06
    //----------------
    0.06
    Act Density 0.145%

    No Known Activations