INDEX
    Explanations

    phrases prompting consideration or reflection

    New Auto-Interp
    Negative Logits
    "},"
    -0.71
    ially
    -0.71
    Cause
    -0.69
    ][/
    -0.67
     Written
    -0.66
    ccess
    -0.66
    \"
    -0.65
    \",
    -0.63
    "}],"
    -0.63
    Requires
    -0.62
    POSITIVE LOGITS
     Exhibit
    0.73
    tainment
    0.70
     examples
    0.70
     Carly
    0.68
     Akron
    0.68
     Leban
    0.68
     example
    0.67
     Fukushima
    0.67
     Rwanda
    0.67
     Corinthians
    0.67
    Act Density 0.103%

    No Known Activations