INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    inker
    -0.07
    Tokenizer
    -0.07
     числі
    -0.06
    íte
    -0.06
    kor
    -0.06
    ahir
    -0.06
    ningen
    -0.06
    urement
    -0.06
    RH
    -0.06
    (",
    -0.06
    POSITIVE LOGITS
    .chdir
    0.07
    0.07
    	nil
    0.06
    =s
    0.06
     Blanch
    0.06
    Continue
    0.06
     Estr
    0.06
    outil
    0.06
    .Step
    0.06
    ัวเอง
    0.06
    Act Density 0.000%

    No Known Activations