INDEX
    Explanations

    information

    New Auto-Interp
    Negative Logits
     officiel
    -0.08
    laf
    -0.08
     insanely
    -0.08
     Sic
    -0.07
     verhind
    -0.07
     Hunting
    -0.07
     nationals
    -0.07
    cito
    -0.07
     Howe
    -0.07
     zie
    -0.07
    POSITIVE LOGITS
     consenting
    0.08
    NEWS
    0.07
    askar
    0.07
     consent
    0.07
    0.07
     discern
    0.07
     dimin
    0.06
     $
    0.06
    onton
    0.06
     None
    0.06
    Act Density 0.007%

    No Known Activations