INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    /year
    -0.07
    -0.06
    Du
    -0.06
    =r
    -0.06
     níž
    -0.06
     новых
    -0.06
     pressures
    -0.06
    -0.06
     HIGH
    -0.06
     whe
    -0.06
    POSITIVE LOGITS
    hist
    0.07
    lua
    0.06
    .family
    0.06
     hätte
    0.06
     arte
    0.06
     pol
    0.06
     Lionel
    0.06
    perc
    0.06
    PLAIN
    0.06
    twitter
    0.06
    Act Density 0.004%

    No Known Activations