INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.07
    2:0.08
    3:0.08
    4:0.07
    5:0.10
    6:0.07
    7:0.07
    8:0.09
    9:0.08
    10:0.08
    11:0.08
    Negative Logits
     ethics
    -2.49
     Ples
    -2.43
    atories
    -2.34
     Ethics
    -2.31
     Prevention
    -2.24
     Ashes
    -2.22
    ogie
    -2.17
     Chiefs
    -2.16
     Kobe
    -2.11
     meg
    -2.11
    POSITIVE LOGITS
    panic
    2.67
    ework
    2.66
     grocer
    2.58
    emonium
    2.57
    oldemort
    2.53
    inally
    2.34
    ivan
    2.27
     Cue
    2.20
     Lump
    2.19
     initialized
    2.18
    Act Density 0.000%

    No Known Activations