INDEX
    Explanations

    mentions of television shows and related content

    New Auto-Interp
    Negative Logits
     Wise
    -0.16
    oft
    -0.16
     intervention
    -0.15
     Ingram
    -0.15
    orp
    -0.14
    vo
    -0.14
     Fair
    -0.14
    190
    -0.14
    eten
    -0.14
    ugin
    -0.14
    POSITIVE LOGITS
    ADVERTISEMENT
    0.17
    ateg
    0.16
    جا
    0.16
    EIF
    0.15
    خص
    0.15
    responses
    0.15
    lyph
    0.14
    xea
    0.14
     Abb
    0.14
    ayah
    0.14
    Act Density 0.093%

    No Known Activations