INDEX
    Explanations

    references to online identities or specific usernames

    New Auto-Interp
    Head Attr Weights
    0:0.17
    1:0.14
    2:0.07
    3:0.09
    4:0.03
    5:0.11
    6:0.03
    7:0.03
    8:0.11
    9:0.08
    10:0.05
    11:0.05
    Negative Logits
    bay
    -1.90
    payers
    -1.76
    iverse
    -1.62
    kiss
    -1.61
    bike
    -1.61
    lif
    -1.58
    file
    -1.58
    pour
    -1.58
    775
    -1.57
    clone
    -1.52
    POSITIVE LOGITS
    rocal
    1.90
    ENE
    1.85
     Contra
    1.73
     Miscellaneous
    1.73
    otine
    1.70
     Sap
    1.70
     Span
    1.66
     Wasserman
    1.59
     Introduction
    1.58
    ggle
    1.58
    Act Density 0.002%

    No Known Activations