INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ulcer
    -0.08
    Character
    -0.07
    poor
    -0.07
    .tile
    -0.07
    .fix
    -0.07
     Character
    -0.07
                             
    -0.07
    Late
    -0.07
    Variation
    -0.07
     want
    -0.07
    POSITIVE LOGITS
    <|endoftext|>
    0.13
    <|reserved_200016|>
    0.09
     pursuits
    0.09
    موضوع
    0.09
    _topic
    0.09
     topic
    0.08
    相關
    0.08
     entities
    0.08
    ാരണ
    0.08
    (topic
    0.08
    Act Density 0.624%

    No Known Activations