INDEX
    Explanations

    adjectives related to rigidity or inflexibility

    terms related to stiffness and rigidity

    New Auto-Interp
    Negative Logits
    ulhu
    -0.81
     Occupations
    -0.75
    orate
    -0.71
    obyl
    -0.70
    hex
    -0.70
     Hiroshima
    -0.67
     Ancients
    -0.66
     Partnership
    -0.65
     Shutterstock
    -0.63
    uyomi
    -0.63
    POSITIVE LOGITS
    ening
    0.95
     stiff
    0.95
    ened
    0.89
    est
    0.88
    eners
    0.87
    nesses
    0.84
    er
    0.84
    weights
    0.76
    ety
    0.75
     undermin
    0.75
    Act Density 0.010%

    No Known Activations