INDEX
    Explanations

    phrases related to plot points or storylines

    references to plot elements in narratives

    New Auto-Interp
    Negative Logits
    IDA
    -0.75
    angelo
    -0.73
    Downloadha
    -0.69
     certify
    -0.67
    ributed
    -0.65
    ategory
    -0.64
    ++++++++++++++++
    -0.64
    KEY
    -0.64
    iosis
    -0.63
    ingu
    -0.61
    POSITIVE LOGITS
     plot
    1.22
     plots
    1.09
     Plot
    1.06
    plot
    1.00
     plotting
    0.97
    ories
    0.88
    Plot
    0.88
     synopsis
    0.82
    lines
    0.79
    ters
    0.77
    Act Density 0.008%

    No Known Activations