INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     яка
    -0.07
     most
    -0.06
     attain
    -0.06
     цієї
    -0.06
    NY
    -0.06
    o
    -0.06
    COVID
    -0.06
    udies
    -0.06
     side
    -0.06
    WOOD
    -0.06
    POSITIVE LOGITS
     Element
    0.08
     element
    0.07
    Critical
    0.07
    	elem
    0.07
    .element
    0.07
    0.07
     sàn
    0.06
     glBegin
    0.06
     elements
    0.06
     сим
    0.06
    Act Density 0.005%

    No Known Activations