INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    larg
    -0.06
     Mercury
    -0.06
     Formal
    -0.06
    寿
    -0.06
     Seeder
    -0.06
    -0.06
     Wear
    -0.06
     telephone
    -0.06
     پیشنهاد
    -0.06
    gin
    -0.06
    POSITIVE LOGITS
     ReadOnly
    0.07
    	inst
    0.07
    _embedding
    0.07
    วไป
    0.07
    converted
    0.07
     veloc
    0.07
    .check
    0.06
     พล
    0.06
    .lock
    0.06
    -related
    0.06
    Act Density 0.030%

    No Known Activations