INDEX
    Explanations

    phrases indicating action-packed or thrilling content in media reviews

    New Auto-Interp
    Negative Logits
    ieber
    -0.07
     Circ
    -0.06
    çķ°
    -0.06
    Prefs
    -0.06
    _RET
    -0.06
    essages
    -0.06
    =yes
    -0.06
    dda
    -0.06
    mia
    -0.06
    祥
    -0.06
    POSITIVE LOGITS
     episode
    0.11
     weekly
    0.10
     Episode
    0.09
    Weekly
    0.09
     Weekly
    0.09
    weekly
    0.08
    episode
    0.08
    Episode
    0.08
     week
    0.07
    riday
    0.06
    Act Density 0.054%

    No Known Activations