INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     viral
    -0.08
     произ
    -0.06
    ikal
    -0.06
    Authorize
    -0.06
    Chicago
    -0.06
    -0.06
     эффек
    -0.06
     pochop
    -0.06
    peare
    -0.06
     desn
    -0.06
    POSITIVE LOGITS
     lobbying
    0.17
     lobby
    0.14
     lobbyist
    0.12
     lobbyists
    0.12
     Lobby
    0.11
    lobby
    0.09
     düzenlem
    0.07
     Lob
    0.07
    _lb
    0.07
    0.07
    Act Density 0.002%

    No Known Activations