INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.04
    2:0.09
    3:0.05
    4:0.07
    5:0.10
    6:0.04
    7:0.05
    8:0.26
    9:0.07
    10:0.07
    11:0.05
    Negative Logits
    lishes
    -1.51
    ":["
    -1.50
     Runs
    -1.47
    ulate
    -1.41
     Intelligent
    -1.36
     Frames
    -1.31
     recogn
    -1.29
    RF
    -1.29
     Seym
    -1.28
     RBI
    -1.28
    POSITIVE LOGITS
    atten
    1.57
    Yan
    1.47
    gettable
    1.47
    antine
    1.41
    ozyg
    1.39
    packages
    1.37
    Ott
    1.35
    minist
    1.35
    hattan
    1.34
    DragonMagazine
    1.34
    Act Density 0.001%

    No Known Activations