INDEX
    Explanations

    Math problems

    New Auto-Interp
    Negative Logits
    -0.09
     palm
    -0.08
     Shelf
    -0.08
     Wink
    -0.08
     cạnh
    -0.08
    चल
    -0.08
    -0.08
     Los
    -0.08
    -0.08
    elting
    -0.08
    POSITIVE LOGITS
     mindestens
    0.10
     almeno
    0.09
     minimaal
    0.09
     cone
    0.08
     cones
    0.08
     minstens
    0.08
     bureaucr
    0.08
     SAF
    0.08
     одно
    0.08
     barr
    0.07
    Act Density 0.028%

    No Known Activations