INDEX
    Explanations

    math problems

    New Auto-Interp
    Negative Logits
     guided
    -0.08
    Extract
    -0.08
     guider
    -0.08
    Apart
    -0.08
    Bearing
    -0.08
     locations
    -0.07
     buchen
    -0.07
     guides
    -0.07
    Guide
    -0.07
    Top
    -0.07
    POSITIVE LOGITS
    美元
    0.09
    評価
    0.09
     નાના
    0.08
    0.08
     dizendo
    0.08
     olevan
    0.08
     jovem
    0.08
     باعث
    0.08
     pervers
    0.08
     ખેલ
    0.08
    Act Density 0.070%

    No Known Activations