INDEX
    Explanations

    references to lobbying and lobbyists

    New Auto-Interp
    Negative Logits
    ume
    -0.16
    oader
    -0.15
     Templ
    -0.15
    ooke
    -0.15
    ỡ
    -0.15
    bucks
    -0.15
     Howell
    -0.15
    iber
    -0.14
    853
    -0.14
    ekyll
    -0.13
    POSITIVE LOGITS
     antenn
    0.15
    åĹ
    0.15
    atar
    0.15
    309
    0.14
    ARENT
    0.14
    665
    0.14
    staw
    0.14
    sten
    0.14
    rl
    0.14
    oru
    0.14
    Act Density 0.009%

    No Known Activations