INDEX
    Explanations

    terms associated with fame and popularity

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.00
    2:0.16
    3:0.10
    4:0.26
    5:0.02
    6:0.06
    7:0.15
    8:0.03
    9:0.04
    10:0.05
    11:0.06
    Negative Logits
    eways
    -1.70
    onde
    -1.65
    express
    -1.62
    cerpt
    -1.46
    regate
    -1.45
    ongyang
    -1.44
    oya
    -1.44
    sels
    -1.43
    eworks
    -1.40
    enda
    -1.40
    POSITIVE LOGITS
     kidding
    1.87
     affair
    1.85
     Centauri
    1.64
     owing
    1.58
     underestimate
    1.53
     grou
    1.51
    lier
    1.47
     Scandinavian
    1.44
     misunderstood
    1.40
     attracting
    1.38
    Act Density 0.010%

    No Known Activations