INDEX
    Explanations

    phrases that indicate starting a narrative or discussion

    the phrase "start with" in various contexts

    New Auto-Interp
    Negative Logits
    span
    -0.73
    hops
    -0.71
    agues
    -0.69
     span
    -0.67
    supported
    -0.65
    late
    -0.64
    affected
    -0.62
    stre
    -0.61
    wik
    -0.61
    paced
    -0.60
    POSITIVE LOGITS
     respect
    0.78
     obvious
    0.73
    draw
    0.72
     regard
    0.70
    amaz
    0.68
     regards
    0.67
     basics
    0.65
     introdu
    0.64
     apologies
    0.64
     Rowe
    0.63
    Act Density 0.040%

    No Known Activations