INDEX
    Explanations

    words related to locations or events

    occurrences of the word "some"

    New Auto-Interp
    Negative Logits
    lished
    -0.81
    eries
    -0.78
    iversal
    -0.77
    olicy
    -0.71
    Downloadha
    -0.69
    rontal
    -0.69
    reddits
    -0.68
    atever
    -0.66
    lishes
    -0.65
    govtrack
    -0.64
    POSITIVE LOGITS
    ome
    1.25
    lette
    0.84
    lement
    0.81
    gran
    0.76
    ppa
    0.74
     Parenthood
    0.74
    gon
    0.73
    chron
    0.73
    olithic
    0.72
     Curve
    0.70
    Act Density 0.008%

    No Known Activations