INDEX
    Explanations

    mentions of specific locations and activities involving people

    references to personal experiences and motivations in relation to political or significant life events

    New Auto-Interp
    Negative Logits
     respectively
    -0.71
    their
    -0.68
    Their
    -0.62
    advertising
    -0.61
    present
    -0.60
    inct
    -0.59
    pieces
    -0.59
    idi
    -0.58
     imprint
    -0.58
     deems
    -0.58
    POSITIVE LOGITS
     myself
    1.11
    arij
    0.74
     Pastebin
    0.73
     yesterday
    0.69
     <[
    0.68
    onnaissance
    0.67
     intending
    0.65
     questionnaire
    0.65
    nesday
    0.65
    bnb
    0.64
    Act Density 1.175%

    No Known Activations