INDEX
    Explanations

    references to educational systems and their critiques

    New Auto-Interp
    Negative Logits
    oby
    -0.16
    aska
    -0.15
    inate
    -0.15
    endar
    -0.15
    ografia
    -0.14
    adian
    -0.14
    ackers
    -0.14
    Responder
    -0.14
     Claus
    -0.14
    otte
    -0.13
    POSITIVE LOGITS
    efa
    0.16
    ãĤıãĤĮ
    0.14
     envy
    0.14
    bane
    0.14
    inema
    0.14
    crow
    0.14
    plode
    0.14
    SED
    0.14
    /layouts
    0.14
    .pk
    0.13
    Act Density 0.235%

    No Known Activations