INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     wash
    -0.08
    нь
    -0.07
     idan
    -0.07
     gland
    -0.07
    -0.07
     anl
    -0.07
    ੀਆ
    -0.07
     Ir
    -0.07
     washing
    -0.07
    _ir
    -0.07
    POSITIVE LOGITS
    Farm
    0.08
    Lib
    0.08
     pointed
    0.08
    obu
    0.07
     Manu
    0.07
    通信
    0.07
     powered
    0.07
     heating
    0.07
     powering
    0.07
     technologies
    0.07
    Act Density 0.003%

    No Known Activations