INDEX
    Explanations

    occurrences of specific movie titles

    references to titles of films or literary works that begin with "The."

    New Auto-Interp
    Negative Logits
     patiently
    -0.71
     contributed
    -0.70
    lished
    -0.70
    omever
    -0.65
     endeav
    -0.64
    posted
    -0.64
    elsen
    -0.64
     authored
    -0.63
    perse
    -0.63
     behalf
    -0.63
    POSITIVE LOGITS
    atre
    1.17
    oret
    1.16
    orem
    1.09
    odor
    1.04
     Simpsons
    1.03
    resa
    1.02
    ories
    1.00
     Greatest
    1.00
    sis
    0.98
     Stranger
    0.95
    Act Density 0.120%

    No Known Activations