INDEX
    Explanations

    phrases related to unexpected revelations or surprising discoveries

    New Auto-Interp
    Negative Logits
    ropolitan
    -0.65
     Colleg
    -0.61
    riot
    -0.61
    atana
    -0.60
    lain
    -0.60
    riots
    -0.59
    negie
    -0.58
    icipated
    -0.58
    è¦ļéĨĴ
    -0.57
    nea
    -0.56
    POSITIVE LOGITS
    ĸ
    0.64
    Ī
    0.64
     beet
    0.64
    terday
    0.63
    WT
    0.62
    pires
    0.59
     sour
    0.59
    Lua
    0.59
    PsyNetMessage
    0.58
    laus
    0.57
    Act Density 4.185%

    No Known Activations