INDEX
    Explanations

    information related to various incidents and news events, potentially focusing on violence and safety concerns

    New Auto-Interp
    Negative Logits
     tupperware
    -1.00
     ecru
    -0.94
     camry
    -0.92
     chrysler
    -0.90
     embodi
    -0.87
     Darum
    -0.87
     unlaw
    -0.85
     hairc
    -0.85
     jetta
    -0.85
     cushi
    -0.81
    POSITIVE LOGITS
    These
    0.95
     These
    0.87
    these
    0.85
     these
    0.67
    hese
    0.61
     Theses
    0.60
    IMPORTED
    0.59
    Even
    0.57
     THESE
    0.57
    DoubleQuotes
    0.57
    Act Density 0.422%

    No Known Activations