INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Pier
    -0.09
     Sexual
    -0.08
     pinnacle
    -0.08
     sexual
    -0.08
     spray
    -0.07
    -0.07
     imprim
    -0.07
     ചോദ
    -0.07
     இன
    -0.07
     Spray
    -0.07
    POSITIVE LOGITS
     divert
    0.08
    alama
    0.08
     العص
    0.08
     magistr
    0.07
     Blackburn
    0.07
    reiro
    0.07
    τσι
    0.07
    banken
    0.07
     Saver
    0.07
     Ans
    0.07
    Act Density 0.032%

    No Known Activations