INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.09
    2:0.07
    3:0.07
    4:0.09
    5:0.08
    6:0.08
    7:0.07
    8:0.08
    9:0.08
    10:0.09
    11:0.08
    Negative Logits
    tell
    -2.64
     Gamma
    -2.54
     Anon
    -2.52
     intruder
    -2.49
     Dread
    -2.43
     Pom
    -2.37
     Blossom
    -2.35
     Jericho
    -2.35
     Tomato
    -2.34
     caller
    -2.33
    POSITIVE LOGITS
     Trafford
    3.23
    3.06
    displayText
    2.94
    elong
    2.89
    ˜
    2.84
    uploads
    2.75
    HCR
    2.71
    ilde
    2.67
    rift
    2.56
    ön
    2.56
    Act Density 0.000%

    No Known Activations