INDEX
    Explanations

    questions starting with "What do you think?"

    questions that prompt discussion or opinion seeking

    New Auto-Interp
    Negative Logits
    WAYS
    -0.87
    ahime
    -0.73
    boats
    -0.70
    Interstitial
    -0.69
    boards
    -0.69
    mobi
    -0.68
    thur
    -0.67
    legram
    -0.65
    acerb
    -0.65
    fox
    -0.65
    POSITIVE LOGITS
     happen
    0.80
    ?]
    0.77
    omsday
    0.72
    actic
    0.69
    iotic
    0.66
    ederal
    0.66
    iosyncr
    0.65
     notation
    0.64
     mean
    0.64
    ?),
    0.63
    Act Density 0.051%

    No Known Activations