INDEX
    Explanations

    the presence of the word "pl" in various forms

    New Auto-Interp
    Negative Logits
    PhysRev
    -0.70
     Hül
    -0.62
    umns
    -0.60
     SEDS
    -0.59
     Segel
    -0.56
    jiny
    -0.55
    -0.55
    يكب
    -0.54
    ItemBackground
    -0.53
    Referências
    -0.53
    POSITIVE LOGITS
     Pl
    2.87
     pl
    2.81
    pl
    2.79
    Pl
    2.79
     PL
    2.75
     pla
    2.73
    PL
    2.68
     Pla
    2.65
    pla
    2.49
    Pla
    2.37
    Act Density 0.096%

    No Known Activations