INDEX
    Explanations

    various forms of the word "marginal."

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.03
    2:0.04
    3:0.06
    4:0.05
    5:0.05
    6:0.40
    7:0.05
    8:0.04
    9:0.07
    10:0.07
    11:0.05
    Negative Logits
    erness
    -1.37
    Downloadha
    -1.26
     captcha
    -1.26
    vision
    -1.25
     prospect
    -1.23
     largeDownload
    -1.21
    stown
    -1.20
    imaru
    -1.16
    undred
    -1.15
    ouver
    -1.15
    POSITIVE LOGITS
    ciation
    1.35
    assi
    1.30
    ————
    1.26
     Tsukuyomi
    1.23
    osi
    1.23
    OTAL
    1.22
    ophe
    1.20
    ée
    1.19
    vich
    1.18
    ————————
    1.17
    Act Density 0.004%

    No Known Activations