INDEX
    Explanations

    words related to importance or severity

    intensifiers or descriptors indicating significant issues or threats

    New Auto-Interp
    Negative Logits
     Leilan
    -0.71
     Reef
    -0.66
     Kirin
    -0.61
     Ley
    -0.61
     Cath
    -0.60
     Kra
    -0.60
     Shepard
    -0.60
     Aus
    -0.60
     Weston
    -0.59
     Muss
    -0.59
    POSITIVE LOGITS
    theless
    1.13
    terday
    1.03
    usterity
    1.03
    tenance
    0.97
    mosp
    0.91
    etheless
    0.89
    gettable
    0.88
    veyard
    0.87
    selves
    0.86
    withstanding
    0.86
    Act Density 0.220%

    No Known Activations