INDEX
    Explanations

    words indicating possibility or likelihood

    New Auto-Interp
    Negative Logits
    ——
    -0.34
    -----
    -0.33
    The
    -0.33
     éc
    -0.32
    Dés
    -0.32
    -0.31
    In
    -0.31
     dérou
    -0.31
     handleSubmit
    -0.31
    いい
    -0.30
    POSITIVE LOGITS
     may
    1.27
     might
    1.05
     MAY
    1.04
     May
    1.01
    may
    0.98
    May
    0.96
    อาจ
    0.94
     Might
    0.93
    Might
    0.90
     MIGHT
    0.89
    Act Density 0.079%

    No Known Activations