INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .ALL
    -0.07
     extreme
    -0.06
    Extreme
    -0.06
     head
    -0.06
     rushed
    -0.06
    _Code
    -0.06
    .images
    -0.06
    _Comm
    -0.06
    zone
    -0.06
    -be
    -0.06
    POSITIVE LOGITS
    _migration
    0.07
    	anim
    0.06
     должен
    0.06
    wij
    0.06
    	select
    0.06
    lesson
    0.06
    (os
    0.06
     Builder
    0.06
    	pc
    0.06
     गर
    0.06
    Act Density 0.002%

    No Known Activations