INDEX
    Explanations

    phrases with the word "supposed" followed by a verb

    references to expectations or requirements regarding actions or situations

    New Auto-Interp
    Negative Logits
    croft
    -0.79
    lake
    -0.71
    isks
    -0.71
    cloth
    -0.69
    hold
    -0.69
    estern
    -0.69
    bu
    -0.68
    lust
    -0.68
    detail
    -0.67
    Charge
    -0.66
    POSITIVE LOGITS
     explan
    0.90
    DonaldTrump
    0.88
     mathemat
    0.84
     mosqu
    0.83
     conflic
    0.79
     nodd
    0.76
     undermin
    0.75
     supposed
    0.74
     compe
    0.71
     millenn
    0.70
    Act Density 0.007%

    No Known Activations