INDEX
    Explanations

    mentions of autographs or signing items

    references to autographs

    New Auto-Interp
    Negative Logits
    UTION
    -0.85
    IRO
    -0.83
     Soda
    -0.81
    RAFT
    -0.80
    BE
    -0.79
    mary
    -0.79
    ENCE
    -0.78
    ISION
    -0.78
    FORE
    -0.76
    ptives
    -0.76
    POSITIVE LOGITS
    ographs
    1.26
    ograph
    1.21
    ographed
    1.09
     aut
    1.04
    iques
    0.99
    istically
    0.98
    ocom
    0.93
    archs
    0.91
    umn
    0.91
    ogyn
    0.89
    Act Density 0.008%

    No Known Activations