INDEX
    Explanations

    actions and behaviors associated with processing and decision-making

    New Auto-Interp
    Head Attr Weights
    0:0.53
    1:0.02
    2:0.04
    3:0.07
    4:0.03
    5:0.05
    6:0.03
    7:0.03
    8:0.05
    9:0.05
    10:0.02
    11:0.02
    Negative Logits
     characteristic
    -1.46
     distinguishing
    -1.45
     progressing
    -1.44
     Rece
    -1.33
     hallmark
    -1.33
    -1.30
     Kats
    -1.30
    -1.29
    oshenko
    -1.29
     Parenthood
    -1.29
    POSITIVE LOGITS
    ify
    3.02
    ulate
    2.85
    itate
    2.75
    ize
    2.64
    perse
    2.54
    igrate
    2.49
    strate
    2.41
    pose
    2.35
    inate
    2.23
    rouse
    2.20
    Act Density 1.329%

    No Known Activations