INDEX
    Explanations

    the mention of "Star Trek" references

    references to the "Star" franchise, specifically "Star Wars" and "Star Trek"

    New Auto-Interp
    Negative Logits
    Downloadha
    -0.93
    terday
    -0.92
    odcast
    -0.81
     confir
    -0.76
    Ń·
    -0.75
    icultural
    -0.74
    sembly
    -0.74
     condem
    -0.73
    xual
    -0.73
    ongyang
    -0.73
    POSITIVE LOGITS
    star
    1.04
    light
    0.94
    Star
    0.93
    vation
    0.90
     Trek
    0.89
    liner
    0.88
    ring
    0.85
    buck
    0.85
     Star
    0.84
    fare
    0.83
    Act Density 0.016%

    No Known Activations