INDEX
    Explanations

    names of characters or people

    characters and their actions within a narrative context

    New Auto-Interp
    Negative Logits
    Repeat
    -0.79
    IED
    -0.77
    interstitial
    -0.77
    Depth
    -0.72
    PLIED
    -0.72
     NUM
    -0.71
    orted
    -0.69
    vered
    -0.68
    TPPStreamerBot
    -0.67
    IFIED
    -0.67
    POSITIVE LOGITS
     discovers
    1.70
     learns
    1.61
     escapes
    1.51
     convin
    1.49
     decides
    1.48
     confronts
    1.48
     realizes
    1.43
     wakes
    1.38
     tries
    1.35
     finds
    1.34
    Act Density 0.255%

    No Known Activations