INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     domain
    -0.07
     Joined
    -0.07
    \Component
    -0.06
     Domain
    -0.06
     possession
    -0.06
    -0.06
    cdn
    -0.06
     FT
    -0.06
    เอง
    -0.06
     Setter
    -0.06
    POSITIVE LOGITS
    0.06
    _ds
    0.06
    :NSMakeRange
    0.06
    غن
    0.06
     benchmark
    0.06
    _SPR
    0.05
    TYPO
    0.05
     أل
    0.05
    كو
    0.05
    .addComponent
    0.05
    Act Density 0.002%

    No Known Activations