INDEX
    Explanations

    phrases indicating important events or moments in time

    instances of the word "when."

    New Auto-Interp
    Negative Logits
    ertain
    -0.76
    whatever
    -0.70
    plus
    -0.67
    ggles
    -0.66
    vre
    -0.66
    bear
    -0.65
    augh
    -0.64
    agin
    -0.63
    ilus
    -0.63
    Grade
    -0.62
    POSITIVE LOGITS
    soever
    1.27
     they
    0.88
    upon
    0.84
     confronted
    0.77
     hackers
    0.75
     someone
    0.73
     hordes
    0.73
     suddenly
    0.72
     he
    0.70
     gunmen
    0.69
    Act Density 0.106%

    No Known Activations