INDEX
    Explanations

    phrases indicating uncertainty or possibility

    expressions of uncertainty or speculation

    New Auto-Interp
    Negative Logits
    ovember
    -0.69
    ----------
    -0.65
    pin
    -0.63
    trap
    -0.62
     translator
    -0.61
    Typ
    -0.60
    La
    -0.60
    Released
    -0.59
    Sac
    -0.59
    idan
    -0.57
    POSITIVE LOGITS
     they
    1.06
     he
    0.89
     THEY
    0.87
    anwhile
    0.86
     she
    0.79
     it
    0.76
    they
    0.75
     we
    0.74
     there
    0.74
     none
    0.72
    Act Density 0.305%

    No Known Activations