INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     deduction
    -0.07
     Tong
    -0.07
     duo
    -0.07
    	cmd
    -0.07
     feature
    -0.06
    ceptive
    -0.06
    ация
    -0.06
     Thickness
    -0.06
    datatype
    -0.06
     thickness
    -0.06
    POSITIVE LOGITS
     Is
    0.09
    .is
    0.08
    :Is
    0.08
    —is
    0.08
    Is
    0.07
    _Is
    0.07
     freelance
    0.07
     hâl
    0.07
    .Is
    0.06
    ’est
    0.06
    Act Density 0.048%

    No Known Activations