INDEX
    Explanations

    phrases indicating assumptions or expectations

    expressions of hypothetical situations or conjectures

    New Auto-Interp
    Negative Logits
     Countdown
    -0.61
    rawdownloadcloneembedreportprint
    -0.58
     Tonight
    -0.57
    Enough
    -0.56
     Leilan
    -0.55
     lobb
    -0.54
     Canberra
    -0.54
     Brill
    -0.54
     Gutenberg
    -0.54
    Ready
    -0.53
    POSITIVE LOGITS
     expect
    1.31
     think
    1.23
     imagine
    1.14
     guess
    1.05
     assume
    1.04
     suppose
    1.04
     presume
    1.04
     wonder
    1.00
     hope
    0.97
     suspect
    0.96
    Act Density 0.077%

    No Known Activations