INDEX
    Explanations

    questions or inquiry-related phrases

    rhetorical questions or phrases emphasizing inquiry

    New Auto-Interp
    Negative Logits
    shore
    -0.66
    roads
    -0.61
    trop
    -0.60
    eer
    -0.59
    Gy
    -0.59
    idon
    -0.59
    println
    -0.59
    ulic
    -0.58
    ped
    -0.58
    gal
    -0.57
    POSITIVE LOGITS
    soever
    1.27
     happens
    1.12
     happened
    1.04
     transpired
    0.97
     distinguishes
    0.96
     happ
    0.90
     else
    0.85
     ensued
    0.84
     constitutes
    0.83
     separates
    0.83
    Act Density 0.086%

    No Known Activations