INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.02
    2:0.09
    3:0.13
    4:0.12
    5:0.05
    6:0.05
    7:0.06
    8:0.05
    9:0.08
    10:0.19
    11:0.09
    Negative Logits
     booted
    -1.77
    slave
    -1.55
    azo
    -1.50
    milo
    -1.48
    cpu
    -1.47
     gul
    -1.47
     experimented
    -1.46
    nda
    -1.46
    gone
    -1.45
    lication
    -1.44
    POSITIVE LOGITS
    racuse
    1.51
     Contribut
    1.49
    iquette
    1.44
    Blog
    1.43
     Blog
    1.42
    Quality
    1.42
     Clifford
    1.42
     Week
    1.39
     Coverage
    1.39
    Hour
    1.38
    Act Density 0.001%

    No Known Activations