INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     peur
    -0.07
     pega
    -0.07
     fearful
    -0.07
     junta
    -0.07
     голову
    -0.07
    tax
    -0.07
    ");//
    -0.07
     சே
    -0.07
    .business
    -0.07
    POSITIVE LOGITS
     Em
    0.09
     Enjoy
    0.08
    <|endoftext|>
    0.08
    ً
    0.08
     Act
    0.07
    Enjoy
    0.07
     Закон
    0.07
    <|reserved_200016|>
    0.07
     Butter
    0.07
    ↵//↵//
    0.07
    Act Density 0.009%

    No Known Activations