INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     laufen
    -0.09
     ض
    -0.08
     ovs
    -0.08
     émer
    -0.08
     settled
    -0.08
     이동
    -0.08
    kommer
    -0.08
    -0.08
    hiba
    -0.07
    foot
    -0.07
    POSITIVE LOGITS
    -rich
    0.08
     flare
    0.08
     intake
    0.08
     antioxidants
    0.08
     antise
    0.07
     plaintext
    0.07
    117
    0.07
     রিপোর্ট
    0.07
     supplementation
    0.07
    FI
    0.07
    Act Density 0.002%

    No Known Activations