INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    agrid
    -0.06
    َن
    -0.06
    -0.06
    емых
    -0.06
     Equal
    -0.06
    ikt
    -0.05
     purch
    -0.05
    UserRole
    -0.05
    319
    -0.05
     COPY
    -0.05
    POSITIVE LOGITS
    koa
    0.08
    .EventHandler
    0.07
    	gl
    0.07
     sweetheart
    0.07
     protester
    0.07
    tein
    0.07
     jong
    0.06
     граждан
    0.06
    innacle
    0.06
    <|begin_of_text|>
    0.06
    Act Density 0.022%

    No Known Activations