INDEX
    Explanations

    tube, tubes, diameter, rack

    New Auto-Interp
    Negative Logits
    ни
    1.54
    1.15
    ння
    1.09
    1.07
    ین
    1.05
    time
    1.05
    st
    1.04
    1.04
    ק
    1.04
    daughter
    1.02
    POSITIVE LOGITS
    1.57
     tubes
    1.56
    1.42
    1.30
    1.27
     tube
    1.23
    一些
    1.22
    1.20
     Tubes
    1.18
    وم
    1.10
    Act Density 0.002%

    No Known Activations