INDEX
    Explanations

    large numerical values, specifically the word "thousand"

    references to the phrase "a thousand."

    New Auto-Interp
    Negative Logits
    odcast
    -0.88
    akening
    -0.88
    livious
    -0.85
    enture
    -0.82
    rica
    -0.82
    NetMessage
    -0.81
    inion
    -0.79
    regon
    -0.79
    untu
    -0.78
    enhagen
    -0.76
    POSITIVE LOGITS
     oxy
    0.81
     snakes
    0.72
     yen
    0.67
     Ake
    0.65
     injection
    0.64
     stripes
    0.64
     lions
    0.63
     Okin
    0.63
     cubic
    0.62
     miles
    0.62
    Act Density 0.029%

    No Known Activations