INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     متعلقه
    -0.76
    copg
    -0.71
     الحره
    -0.70
     houſe
    -0.69
    }\]
    -0.68
    )__
    -0.67
    enumi
    -0.66
     itſelf
    -0.65
    SequentialGroup
    -0.65
    parsedMessage
    -0.65
    POSITIVE LOGITS
     condition
    0.71
     inner
    0.65
     true
    0.65
     extent
    0.62
     go
    0.60
     scale
    0.59
     status
    0.57
     nature
    0.57
     real
    0.54
     number
    0.52
    Act Density 0.003%

    No Known Activations