INDEX
    Explanations
    New Auto-Interp
    Negative Logits
                                                                                   
    -0.07
    аніт
    -0.07
    simple
    -0.06
     nr
    -0.06
    irror
    -0.06
     pers
    -0.06
    antes
    -0.06
     ge
    -0.06
    packet
    -0.06
    -0.06
    POSITIVE LOGITS
    0.07
     eldest
    0.07
     Targets
    0.07
     unanswered
    0.07
     */)
    0.06
    _CLOSE
    0.06
    (GLFW
    0.06
     Everything
    0.06
     Roberts
    0.06
    Eine
    0.06
    Act Density 0.027%

    No Known Activations