INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hairst
    -0.06
     "";
    ↵
    -0.06
     connectivity
    -0.06
     Berk
    -0.06
    ecies
    -0.06
    unifu
    -0.06
    ogram
    -0.06
    cooked
    -0.06
     payments
    -0.06
     Careers
    -0.06
    POSITIVE LOGITS
     sponsor
    0.22
     sponsors
    0.22
    ponsors
    0.17
     Sponsor
    0.14
    ponsor
    0.13
    sponsor
    0.11
     sponsoring
    0.09
    Conference
    0.07
     pimp
    0.07
     spons
    0.07
    Act Density 0.003%

    No Known Activations