INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Object
    -0.09
     NO
    -0.08
    śród
    -0.07
     CR
    -0.07
     artworks
    -0.07
    DOT
    -0.07
     bold
    -0.07
    quotes
    -0.07
     biodiversity
    -0.07
     бо
    -0.07
    POSITIVE LOGITS
    .presenter
    0.08
    дир
    0.08
    Initializing
    0.08
    Driven
    0.08
     estén
    0.08
    0.08
     Driven
    0.08
     preocupado
    0.08
     completing
    0.08
    ={{↵
    0.08
    Act Density 0.011%

    No Known Activations