INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    OOK
    -0.07
     inert
    -0.07
     successors
    -0.06
     outreach
    -0.06
    olation
    -0.06
    -coordinate
    -0.06
     ве
    -0.06
     hoe
    -0.06
     Fried
    -0.06
    cbc
    -0.06
    POSITIVE LOGITS
     Diseases
    0.07
    <path
    0.06
    >Create
    0.06
     tapered
    0.06
    athan
    0.06
    니다
    0.06
     Cardio
    0.06
    (confirm
    0.06
     Kirby
    0.05
     visceral
    0.05
    Act Density 0.002%

    No Known Activations