INDEX
    Explanations

    phrases related to potential harm or danger

    phrases indicating the concept of risk and danger to lives or communities

    New Auto-Interp
    Negative Logits
     Britt
    -0.69
    tein
    -0.68
    lins
    -0.68
    anche
    -0.66
    ION
    -0.66
    Nap
    -0.65
    encer
    -0.64
    frey
    -0.63
    NC
    -0.62
     Kinnikuman
    -0.62
    POSITIVE LOGITS
    taker
    0.77
    taking
    0.76
    »Ĵ
    0.74
    groups
    0.70
     situations
    0.69
    ħĭ
    0.68
    xual
    0.67
    ailability
    0.66
     stewards
    0.65
    orses
    0.65
    Act Density 0.012%

    No Known Activations