INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     alphabet
    -0.07
     avid
    -0.07
     paren
    -0.07
     soften
    -0.07
     красив
    -0.07
     oid
    -0.07
     conserv
    -0.06
     ---
    -0.06
     prefect
    -0.06
     чет
    -0.06
    POSITIVE LOGITS
     desenv
    0.11
     desarrollo
    0.11
     desarroll
    0.10
    <System
    0.07
    .se
    0.07
    0.07
     Dale
    0.07
     Campos
    0.07
     Devil
    0.06
     Development
    0.06
    Act Density 0.010%

    No Known Activations