INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Levitra
    -0.08
    eks
    -0.07
    -0.07
    ================================================================
    -0.07
    riors
    -0.07
    faculty
    -0.07
    <=$
    -0.07
    asting
    -0.07
     Rotary
    -0.07
    -0.07
    POSITIVE LOGITS
     hunters
    0.08
     enumer
    0.07
     גדולה
    0.07
     falling
    0.07
     fsm
    0.07
    SetFont
    0.07
     compreh
    0.07
     גדול
    0.07
    Unmount
    0.07
     memorable
    0.07
    Act Density 0.002%

    No Known Activations