INDEX
    Explanations

    terms related to consistency and stability in performance or behavior

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.02
    2:0.08
    3:0.08
    4:0.06
    5:0.04
    6:0.09
    7:0.34
    8:0.03
    9:0.04
    10:0.07
    11:0.09
    Negative Logits
    OULD
    -1.70
    izon
    -1.65
    -1.63
    endez
    -1.58
    andals
    -1.58
    reason
    -1.57
    ヴァ
    -1.57
    -1.53
    fitting
    -1.47
    -1.47
    POSITIVE LOGITS
     steady
    1.86
     rotating
    1.66
     footsteps
    1.65
     chorus
    1.59
     obedient
    1.53
     dwindling
    1.50
     vibration
    1.49
     downward
    1.48
     trickle
    1.47
     drum
    1.47
    Act Density 0.001%

    No Known Activations