INDEX
    Explanations

    words related to speculation, analysis, and trends

    language indicating opinions or predictions

    New Auto-Interp
    Negative Logits
    ?",
    -0.60
    )",
    -0.59
    "},
    -0.57
     Femin
    -0.57
     reply
    -0.53
     poems
    -0.53
     Nig
    -0.53
     ,"
    -0.53
     poem
    -0.52
    ",
    -0.52
    POSITIVE LOGITS
     expect
    0.76
    etheless
    0.71
     hoping
    0.66
     anticipate
    0.62
     nonetheless
    0.61
    ivably
    0.60
    fy
    0.59
     caution
    0.58
     anecd
    0.58
     unlikely
    0.58
    Act Density 0.965%

    No Known Activations