INDEX
    Explanations

    adjectives or verbs related to being impressed

    expressions of admiration or impact

    New Auto-Interp
    Negative Logits
     hijacked
    -0.66
    adobe
    -0.66
    nia
    -0.64
    space
    -0.64
     sovere
    -0.62
    angled
    -0.60
     saline
    -0.60
    umble
    -0.59
     starch
    -0.58
    yll
    -0.58
    POSITIVE LOGITS
    mented
    1.09
    ments
    1.06
    ively
    1.02
    ment
    0.93
    mentation
    0.91
    MENT
    0.84
    iveness
    0.84
    ences
    0.83
    enced
    0.83
    encer
    0.81
    Act Density 0.062%

    No Known Activations