INDEX
    Explanations

    common English pronouns

    the repeated mention of the word "Who."

    New Auto-Interp
    Negative Logits
    MER
    -0.80
    interstitial
    -0.75
    PORT
    -0.68
    rations
    -0.67
    Roy
    -0.65
    âĸº
    -0.65
    BACK
    -0.64
     Mast
    -0.62
    ANA
    -0.61
    pit
    -0.60
    POSITIVE LOGITS
    soever
    1.40
    abouts
    0.93
    oping
    0.91
    ever
    0.88
    resy
    0.84
    ileaks
    0.83
    oped
    0.83
     trave
    0.81
    ictionary
    0.81
    tymology
    0.81
    Act Density 0.125%

    No Known Activations