INDEX
    Explanations

    questions starting with "How does" and "What does"

    questions that begin with "how does" or related phrases

    New Auto-Interp
    Negative Logits
    fights
    -0.79
    ascript
    -0.76
    devices
    -0.76
    isphere
    -0.74
    boats
    -0.72
    runners
    -0.72
    cies
    -0.70
    tracks
    -0.70
    ishly
    -0.70
    sers
    -0.70
    POSITIVE LOGITS
     anyone
    0.86
     anybody
    0.86
    olation
    0.83
    olated
    0.72
     one
    0.68
    omorphic
    0.67
    olate
    0.67
     it
    0.67
     this
    0.67
    onga
    0.63
    Act Density 0.051%

    No Known Activations