INDEX
    Explanations

    punctuation

    New Auto-Interp
    Negative Logits
     arrange
    -0.07
    -0.06
    Right
    -0.06
    举起
    -0.06
    -0.06
     occurrences
    -0.06
    -thirds
    -0.06
    Ask
    -0.06
    זו
    -0.06
     summ
    -0.06
    POSITIVE LOGITS
    田园
    0.08
    /ref
    0.07
    _RECORD
    0.07
     ancest
    0.07
    0.07
    0.07
    0.07
    图画
    0.07
    DCALL
    0.07
     בג
    0.07
    Act Density 0.048%

    No Known Activations