INDEX
    Explanations

    proper nouns related to individuals, particularly the name "Warren" with varying strengths of activation

    mentions of the name "Warren."

    New Auto-Interp
    Negative Logits
    dayName
    -0.87
    pmwiki
    -0.78
    lies
    -0.74
    netflix
    -0.73
    ãĤ´ãĥ³
    -0.71
    liness
    -0.70
    etheless
    -0.65
    cific
    -0.65
    odic
    -0.64
    ly
    -0.64
    POSITIVE LOGITS
     Buffett
    1.41
    sburg
    1.12
     Farrell
    0.96
    rade
    0.91
     Sapp
    0.91
     Harding
    0.89
     Buff
    0.86
     Warren
    0.83
    shire
    0.77
     Burger
    0.76
    Act Density 0.027%

    No Known Activations