INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	pos
    -0.07
     земля
    -0.07
    行政
    -0.07
    oultry
    -0.06
    acci
    -0.06
    етод
    -0.06
    建设
    -0.06
    ;background
    -0.06
    (subject
    -0.06
    ialect
    -0.06
    POSITIVE LOGITS
    ![
    0.07
    _fit
    0.06
    dfa
    0.06
    emos
    0.06
    .run
    0.06
     VERIFY
    0.06
    -era
    0.06
    (Runtime
    0.06
    فر
    0.06
     Tes
    0.06
    Act Density 0.001%

    No Known Activations