INDEX
    Explanations

    instances of the word "surely" with high activation values

    the word "surely" and its variations in various contexts suggesting certainty or emphasis

    New Auto-Interp
    Negative Logits
    ocene
    -0.87
    arthed
    -0.78
    insula
    -0.78
    psey
    -0.77
    iatus
    -0.75
    entary
    -0.71
    NING
    -0.70
    anwhile
    -0.69
    apsed
    -0.66
    rition
    -0.66
    POSITIVE LOGITS
     someday
    0.83
    è¦
    0.72
     ought
    0.67
     deserved
    0.66
    footed
    0.66
     deserve
    0.65
     nud
    0.64
     await
    0.63
     surpass
    0.62
     deserves
    0.62
    Act Density 0.029%

    No Known Activations