INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     commitments
    -0.08
    abes
    -0.08
     המט
    -0.08
    .thrift
    -0.08
    Child
    -0.08
    -0.08
     че
    -0.08
     dedication
    -0.08
     ба
    -0.08
     tat
    -0.08
    POSITIVE LOGITS
    0.09
     dielectric
    0.08
    लेक्ट्र
    0.08
    禁止
    0.08
     repell
    0.08
     ceramics
    0.08
     ceramic
    0.07
    /pass
    0.07
     compris
    0.07
     impede
    0.07
    Act Density 0.005%

    No Known Activations