INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    lla
    -0.07
    -0.07
    mill
    -0.07
    -0.06
    إن
    -0.06
     NZ
    -0.06
     loses
    -0.06
     eux
    -0.06
    óm
    -0.06
    miş
    -0.06
    POSITIVE LOGITS
    \Domain
    0.07
    irma
    0.06
    iect
    0.06
     Longitude
    0.06
     summit
    0.06
    FXML
    0.06
    .toast
    0.06
    	Main
    0.06
     Kensington
    0.06
    =label
    0.06
    Act Density 0.010%

    No Known Activations