INDEX
    Explanations

    sports-related terms and entities

    New Auto-Interp
    Negative Logits
     Gates
    -0.69
    idges
    -0.65
     Flowers
    -0.60
     miscar
    -0.59
     Sunshine
    -0.59
     Strait
    -0.59
     faulty
    -0.58
     Reincarnated
    -0.58
    ipop
    -0.58
     Bride
    -0.58
    POSITIVE LOGITS
    manship
    1.23
    nell
    1.00
    men
    0.92
    bike
    0.90
    fan
    0.88
     Illustrated
    0.88
    scar
    0.83
    sw
    0.82
    friends
    0.82
    mens
    0.78
    Act Density 0.516%

    No Known Activations