INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.51
    0.51
    м
    0.48
    0.47
     année
    0.45
    ق
    0.45
    上映
    0.44
    ә
    0.44
    0.44
    ّر
    0.44
    POSITIVE LOGITS
     disruptions
    0.49
     jewel
    0.48
     fieldwork
    0.48
     undertakes
    0.47
     hypogly
    0.46
     wales
    0.46
     converts
    0.44
     تول
    0.44
     Practitioners
    0.44
     medics
    0.44
    Act Density 0.003%

    No Known Activations