INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bail
    -0.09
    -0.09
    हा
    -0.08
     bif
    -0.08
    .sun
    -0.07
     tätig
    -0.07
     Trans
    -0.07
    -operated
    -0.07
     provision
    -0.07
    [m
    -0.07
    POSITIVE LOGITS
     asylum
    0.09
     شهاد
    0.08
    ći
    0.08
     gemeent
    0.08
    airie
    0.08
     gegense
    0.08
    rið
    0.08
     hồ
    0.08
     dham
    0.08
    uganda
    0.08
    Act Density 0.002%

    No Known Activations