INDEX
    Explanations

    instances of the word "Perhaps" followed by a sentence

    New Auto-Interp
    Negative Logits
    lete
    -0.86
    iya
    -0.86
    ament
    -0.83
    cies
    -0.78
    ombat
    -0.75
    ieve
    -0.74
    ieves
    -0.73
    ocaust
    -0.73
    atches
    -0.73
    cium
    -0.72
    POSITIVE LOGITS
     someday
    1.18
     misunder
    0.90
     unsurprisingly
    0.88
     subconscious
    0.86
     sensing
    0.85
     underest
    0.81
     underestimate
    0.78
     overest
    0.77
     somew
    0.75
     exagger
    0.74
    Act Density 1.139%

    No Known Activations