INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.02
    2:0.11
    3:0.07
    4:0.14
    5:0.03
    6:0.04
    7:0.11
    8:0.05
    9:0.03
    10:0.12
    11:0.20
    Negative Logits
     Chronic
    -1.34
     Species
    -1.32
    tan
    -1.31
     Yam
    -1.24
     subordinate
    -1.19
     @@
    -1.18
     Kod
    -1.18
    helial
    -1.17
     Clan
    -1.16
     Chin
    -1.15
    POSITIVE LOGITS
    asio
    1.44
    cffff
    1.40
    bnb
    1.40
    apo
    1.39
    Cand
    1.36
     spoof
    1.36
    reader
    1.34
    ournal
    1.32
    livious
    1.31
    Override
    1.29
    Act Density 0.023%

    No Known Activations