INDEX
    Explanations

    phrases indicating a surprising or revealing discovery

    instances of the phrase "it turns out."

    New Auto-Interp
    Negative Logits
    è¦ļéĨĴ
    -1.05
    accompan
    -0.80
    rongh
    -0.70
    riot
    -0.68
     Expansion
    -0.67
     Repeat
    -0.64
    aign
    -0.59
    76561
    -0.58
     Reserved
    -0.57
    riots
    -0.57
    POSITIVE LOGITS
     out
    1.08
    entious
    0.77
    orned
    0.66
     inward
    0.65
    out
    0.65
    outs
    0.63
    hift
    0.62
    enum
    0.60
     forth
    0.60
     doubtful
    0.60
    Act Density 0.017%

    No Known Activations