INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    COP
    -0.08
     réunions
    -0.08
    -0.08
    clature
    -0.08
     Dmit
    -0.08
    .refs
    -0.08
    stime
    -0.08
     фиг
    -0.07
    fter
    -0.07
     overloaded
    -0.07
    POSITIVE LOGITS
     eligibility
    0.09
     eligible
    0.09
    eligible
    0.08
     Eligibility
    0.08
    elig
    0.08
    Eligible
    0.08
    Eligibility
    0.08
    aroo
    0.08
     Eligible
    0.08
     EP
    0.08
    Act Density 0.001%

    No Known Activations