INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.09
     tanning
    -0.08
     booze
    -0.08
     menopause
    -0.08
    举报
    -0.08
     Rotten
    -0.08
     Romance
    -0.08
     Scam
    -0.08
    男性
    -0.08
     Massage
    -0.08
    POSITIVE LOGITS
     FPGA
    0.15
    ilinx
    0.11
     knobs
    0.11
     ASIC
    0.10
     circuits
    0.10
     tuning
    0.09
     grids
    0.09
     Bits
    0.09
     hardware
    0.09
     knob
    0.09
    Act Density 0.008%

    No Known Activations