INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    prar
    -0.16
    uldu
    -0.15
    urve
    -0.15
    ponce
    -0.15
    ileo
    -0.15
    odont
    -0.15
    edo
    -0.15
    ÅĦ
    -0.14
    gross
    -0.14
    ções
    -0.14
    POSITIVE LOGITS
    iac
    0.30
    stem
    0.23
    washing
    0.23
    storm
    0.22
    storms
    0.22
     fog
    0.21
     Fog
    0.21
    power
    0.21
    child
    0.20
    /body
    0.20
    Act Density 0.012%

    No Known Activations