INDEX
    Explanations

    phrases indicating risk or vulnerability

    New Auto-Interp
    Negative Logits
    uart
    -0.15
    mares
    -0.15
    .singleton
    -0.14
     å¼ķ
    -0.14
    rove
    -0.14
    aires
    -0.14
    .Errors
    -0.14
    acity
    -0.14
    .prevent
    -0.13
    upal
    -0.13
    POSITIVE LOGITS
     increased
    0.34
     heightened
    0.28
     greater
    0.27
     greatest
    0.26
     Increased
    0.25
    greater
    0.22
    Increased
    0.22
     imminent
    0.20
     Greatest
    0.20
    Greater
    0.19
    Act Density 0.017%

    No Known Activations