INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     boards
    -0.07
     tort
    -0.07
     fury
    -0.07
    orie
    -0.06
    #/
    -0.06
    continuous
    -0.06
    -*
    -0.06
     obedient
    -0.06
     polling
    -0.06
    واهد
    -0.06
    POSITIVE LOGITS
     Qatar
    0.10
    .stringify
    0.07
     INDIRECT
    0.06
     tbody
    0.06
    ідом
    0.06
    Ngoài
    0.06
    Persons
    0.06
    iscrimination
    0.06
     agitation
    0.06
    501
    0.06
    Act Density 0.003%

    No Known Activations