INDEX
    Explanations

    instances where the word "perhaps" is used

    instances of the word "perhaps"

    New Auto-Interp
    Negative Logits
    lete
    -0.73
    ocene
    -0.73
    arthed
    -0.72
    tains
    -0.72
    vance
    -0.71
    ulative
    -0.70
    elight
    -0.69
    yers
    -0.68
    eworld
    -0.68
    ved
    -0.67
    POSITIVE LOGITS
     unsurprisingly
    0.80
     "$:/
    0.76
     sensing
    0.75
     unsus
    0.73
     amen
    0.73
     allev
    0.72
     irrit
    0.71
     opio
    0.70
     ironically
    0.70
     occas
    0.68
    Act Density 0.016%

    No Known Activations