INDEX
    Explanations

    words related to forceful actions or impacts

    words related to dramatic or impactful actions

    New Auto-Interp
    Negative Logits
     PF
    -0.78
     DOC
    -0.70
     ob
    -0.67
     broch
    -0.63
     science
    -0.63
     Ik
    -0.63
     Norway
    -0.62
     Frey
    -0.62
     ther
    -0.61
     amen
    -0.61
    POSITIVE LOGITS
    ashing
    4.17
    ashed
    2.79
    ashes
    2.70
    ASH
    2.03
    ash
    1.79
    asher
    1.35
    attering
    1.30
    atching
    1.25
    usting
    1.22
    acking
    1.11
    Act Density 0.005%

    No Known Activations