INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    oor
    -0.06
     insults
    -0.06
     melting
    -0.06
    ingu
    -0.06
    .getBounds
    -0.06
    	driver
    -0.06
     Mother
    -0.06
     Injector
    -0.06
     diver
    -0.06
     چاپ
    -0.06
    POSITIVE LOGITS
     vacations
    0.07
    ...↵↵↵↵↵↵
    0.07
    ,t
    0.07
    947
    0.06
    521
    0.06
     porta
    0.06
     iletişim
    0.06
    :add
    0.06
    0.06
     dle
    0.06
    Act Density 0.000%

    No Known Activations