INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .
    -0.57
     are
    -0.54
    :
    -0.53
     ')[
    -0.51
    "]).
    -0.49
    }}}\
    -0.47
    }}^{\
    -0.45
    ']).
    -0.44
     čty
    -0.43
    -0.43
    POSITIVE LOGITS
     itſelf
    0.84
     it
    0.81
     nakalista
    0.79
    RegistryLite
    0.75
    FunctionFlags
    0.73
     ویکی‌پدی
    0.73
    OGND
    0.72
    الحياه
    0.71
    WebElementEntity
    0.71
     ComVisible
    0.69
    Act Density 0.077%

    No Known Activations