INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.08
    4:0.08
    5:0.08
    6:0.08
    7:0.08
    8:0.08
    9:0.09
    10:0.07
    11:0.08
    Negative Logits
    Block
    -3.08
     Burnett
    -2.98
    Gate
    -2.96
    Kings
    -2.91
    blocks
    -2.85
    allah
    -2.84
    renheit
    -2.82
    uebl
    -2.75
    opol
    -2.75
    hoe
    -2.74
    POSITIVE LOGITS
     NG
    2.90
     zo
    2.79
     implant
    2.79
     Shinzo
    2.68
     strap
    2.55
     wearable
    2.52
     mand
    2.48
     SVG
    2.46
     enlightenment
    2.46
     Okin
    2.44
    Act Density 0.000%

    No Known Activations