INDEX
    Explanations

    polynomial roots/zeros

    New Auto-Interp
    Negative Logits
     expir
    -0.08
     gus
    -0.08
     Prag
    -0.07
    -0.07
     gay
    -0.07
     Spa
    -0.07
     COPD
    -0.07
     spacious
    -0.07
     Lotus
    -0.07
    'annonce
    -0.07
    POSITIVE LOGITS
    adia
    0.08
    dac
    0.08
     świat
    0.07
     Control
    0.07
    uropa
    0.07
    0.07
     світ
    0.07
     ::
    0.07
     Alter
    0.07
    _control
    0.07
    Act Density 0.019%

    No Known Activations