INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    768
    -0.08
     vv
    -0.07
    	startActivity
    -0.06
     Rice
    -0.06
     встре
    -0.06
     versch
    -0.06
    ільш
    -0.06
     puppies
    -0.06
     herpes
    -0.06
    irket
    -0.06
    POSITIVE LOGITS
    Bool
    0.09
    bool
    0.08
    _Bool
    0.08
    ABILITY
    0.08
     Bugs
    0.08
     boolean
    0.07
    βε
    0.07
    Л
    0.07
    baugh
    0.07
    .SetBool
    0.07
    Act Density 0.009%

    No Known Activations