INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.09
    2:0.07
    3:0.09
    4:0.07
    5:0.08
    6:0.06
    7:0.09
    8:0.07
    9:0.06
    10:0.10
    11:0.08
    Negative Logits
    Introduced
    -2.10
    -2.08
    VERSION
    -2.06
    sbm
    -2.06
     Poc
    -2.01
    abba
    -2.00
    version
    -1.99
    DERR
    -1.89
     =================
    -1.88
     ($)
    -1.85
    POSITIVE LOGITS
    rehens
    2.15
    itely
    2.05
     suspense
    2.00
    aily
    1.98
    conservancy
    1.96
    udos
    1.95
    opard
    1.89
    oldown
    1.89
     NPR
    1.89
     Dharma
    1.89
    Act Density 0.000%

    No Known Activations