INDEX
    Explanations

    phrases indicating possibility or likelihood

    speculative statements or expressions of possibility

    New Auto-Interp
    Negative Logits
    ament
    -0.82
    cies
    -0.73
     Lauder
    -0.73
     Constant
    -0.71
     Leah
    -0.67
     Palest
    -0.65
    shall
    -0.60
    oric
    -0.60
    rett
    -0.60
    cedented
    -0.60
    POSITIVE LOGITS
     someday
    1.19
    ily
    1.17
    haps
    1.04
     feas
    1.01
    iest
    1.00
     conce
    0.98
     offend
    0.95
     be
    0.94
     plaus
    0.93
    hap
    0.88
    Act Density 0.053%

    No Known Activations