INDEX
    Explanations

    words related to sponsorship and support

    New Auto-Interp
    Negative Logits
     Gell
    -0.62
    ctile
    -0.62
    üdis
    -0.60
    Joh
    -0.58
    üll
    -0.58
     bege
    -0.57
    DRIVER
    -0.57
     dresser
    -0.57
    v
    -0.55
     nido
    -0.54
    POSITIVE LOGITS
     sponsors
    1.73
     Sponsors
    1.67
     sponsor
    1.65
     Sponsor
    1.63
    onsors
    1.53
     sponsored
    1.52
    Sponsors
    1.52
    Sponsor
    1.50
     sponsorship
    1.47
     Sponsored
    1.44
    Act Density 0.004%

    No Known Activations