INDEX
    Explanations

    phrases starting with "To put" followed by an explanation or statement

    phrases related to summarizing or rephrasing information

    New Auto-Interp
    Negative Logits
    externalActionCode
    -0.73
    iller
    -0.67
    cius
    -0.66
    vill
    -0.63
    è¦ļéĨĴ
    -0.63
    COL
    -0.63
    KEN
    -0.62
    KY
    -0.62
     violated
    -0.60
    Developer
    -0.59
    POSITIVE LOGITS
     aside
    1.01
     together
    0.99
     bluntly
    0.93
    ogether
    0.86
     succinct
    0.85
     it
    0.84
    atively
    0.80
     things
    0.79
     plainly
    0.76
    hetically
    0.75
    Act Density 0.055%

    No Known Activations