INDEX
    Explanations

    mentions of a specific television show

    references to a specific television show

    New Auto-Interp
    Negative Logits
    ntil
    -0.79
    orget
    -0.68
     steel
    -0.68
     granite
    -0.68
    xon
    -0.65
     stone
    -0.62
    ilateral
    -0.62
    quez
    -0.61
    jri
    -0.61
     clot
    -0.60
    POSITIVE LOGITS
    runner
    1.54
    runners
    1.53
    biz
    1.14
    manship
    1.00
    Netflix
    0.97
     premiered
    0.94
     Hulu
    0.93
    wright
    0.92
     airs
    0.88
    creator
    0.87
    Act Density 0.059%

    No Known Activations