INDEX
    Explanations

    references to specific episodes

    references to specific episodes of television shows

    New Auto-Interp
    Negative Logits
    ntil
    -0.90
    milo
    -0.80
    hee
    -0.80
    sheets
    -0.76
    wegian
    -0.73
    enei
    -0.71
    pse
    -0.71
    gn
    -0.70
     unchecked
    -0.69
    covered
    -0.69
    POSITIVE LOGITS
     episode
    1.27
     episodes
    1.06
    Episode
    0.92
    episode
    0.91
     Episode
    0.89
     finale
    0.86
    isodes
    0.86
    ĸļ
    0.84
     opener
    0.76
    ODE
    0.75
    Act Density 0.014%

    No Known Activations