INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.02
    2:0.05
    3:0.07
    4:0.16
    5:0.03
    6:0.12
    7:0.06
    8:0.06
    9:0.03
    10:0.07
    11:0.25
    Negative Logits
     coli
    -1.86
     Chronic
    -1.62
    "/>
    -1.59
    tan
    -1.56
     delinquent
    -1.53
    }"
    -1.52
    *.
    -1.50
     hereafter
    -1.50
    .>>
    -1.49
     Sabb
    -1.47
    POSITIVE LOGITS
    livious
    2.46
    iator
    1.92
    aughs
    1.80
    leans
    1.80
    reet
    1.75
     teasp
    1.71
    undai
    1.69
    affles
    1.68
    gettable
    1.60
    uador
    1.59
    Act Density 0.001%

    No Known Activations