INDEX
    Explanations

    phrases indicating lack of association or relevance

    phrases emphasizing the lack of relevance or connection

    New Auto-Interp
    Negative Logits
    ©¶æ¥µ
    -0.88
    kept
    -0.74
    psons
    -0.72
     polled
    -0.70
    hiba
    -0.68
    marg
    -0.68
    umper
    -0.67
    ³³³³³³³³
    -0.67
    fal
    -0.66
    warn
    -0.66
    POSITIVE LOGITS
     regard
    0.69
     determining
    0.68
     what
    0.65
     deciding
    0.65
     fixing
    0.65
     selecting
    0.62
     destiny
    0.61
     regards
    0.61
     respecting
    0.61
     designing
    0.60
    Act Density 0.036%

    No Known Activations