INDEX
    Explanations

    terms related to severity or intensity, particularly in negative contexts

    New Auto-Interp
    Negative Logits
    ividual
    -0.90
    ovember
    -0.85
    assies
    -0.84
    ĸļ
    -0.81
    akeru
    -0.75
    isphere
    -0.72
    ICLE
    -0.71
    xxxxxxxx
    -0.71
    iliary
    -0.71
    phis
    -0.71
    POSITIVE LOGITS
     punishments
    1.06
     punishment
    1.01
     winters
    1.00
    ness
    0.99
    est
    0.98
    nesses
    0.95
     retribution
    0.95
     harsh
    0.92
     penalties
    0.88
     harshly
    0.87
    Act Density 0.013%

    No Known Activations