INDEX
    Explanations

    references to impulsivity and its consequences

    New Auto-Interp
    Negative Logits
    bla
    -0.16
    edik
    -0.15
     Casc
    -0.14
    xy
    -0.14
    ovali
    -0.14
    Ìģc
    -0.14
    alah
    -0.14
    fo
    -0.14
    ENABLE
    -0.14
    'gc
    -0.14
    POSITIVE LOGITS
     imp
    0.15
    kami
    0.15
    onse
    0.14
    ynch
    0.14
    mux
    0.14
    iere
    0.14
    ingers
    0.14
    inen
    0.14
    antt
    0.13
    çĤī
    0.13
    Act Density 0.003%

    No Known Activations