INDEX
    Explanations

    phrases indicating something that has not been done or completed yet

    phrases indicating something that has not occurred or been completed yet

    New Auto-Interp
    Negative Logits
    ufact
    -0.82
    gang
    -0.69
     similarities
    -0.65
    ãĥ¼ãĥ
    -0.65
    chio
    -0.62
    ãĥ³ãĤ¸
    -0.60
     disproportion
    -0.60
    ging
    -0.60
    packs
    -0.59
    gers
    -0.59
    POSITIVE LOGITS
    terday
    0.78
    ?:
    0.72
    hin
    0.71
     ;)
    0.70
     anyways
    0.70
    !
    0.70
    here
    0.69
     anyway
    0.69
    NESS
    0.69
    !!
    0.68
    Act Density 0.028%

    No Known Activations