INDEX
    Explanations

    mentions of personal knowledge or beliefs

    references to personal experiences and opinions

    New Auto-Interp
    Negative Logits
     moderation
    -0.73
     smashing
    -0.69
     masc
    -0.68
     menstrual
    -0.67
     extravag
    -0.64
     festive
    -0.64
    interstitial
    -0.63
     gren
    -0.61
     scra
    -0.61
    Interstitial
    -0.60
    POSITIVE LOGITS
     know
    1.73
    know
    1.62
     KNOW
    1.58
    Know
    1.54
     Know
    1.53
     knows
    1.50
     knew
    1.49
    knowledge
    1.35
     knowing
    1.35
     knowledge
    1.24
    Act Density 0.286%

    No Known Activations