INDEX
    Explanations

    language indicating a sense of urgency or seriousness, particularly related to a negative situation

    instances of the word "dire" indicating urgent or severe situations

    New Auto-Interp
    Negative Logits
    obbies
    -0.84
    adesh
    -0.81
    nesota
    -0.79
    ertodd
    -0.77
    adding
    -0.75
    imately
    -0.74
    į
    -0.73
    andise
    -0.72
    ACP
    -0.72
     fixme
    -0.71
    POSITIVE LOGITS
    wolf
    0.96
    gency
    0.91
    wolves
    0.88
    ly
    0.87
    stal
    0.85
     dire
    0.81
    bly
    0.80
    sted
    0.78
    lin
    0.78
    cci
    0.77
    Act Density 0.011%

    No Known Activations