INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    uncan
    -0.07
    DX
    -0.06
    ForeColor
    -0.06
    .schedule
    -0.06
     Courage
    -0.06
     Sugar
    -0.06
    _SHOW
    -0.06
     achie
    -0.06
     πως
    -0.06
     guns
    -0.06
    POSITIVE LOGITS
     Jupiter
    0.13
    upiter
    0.10
    0.07
    าพ
    0.07
     Pluto
    0.07
     Neptune
    0.07
    thy
    0.07
    dict
    0.06
     laundering
    0.06
     انجمن
    0.06
    Act Density 0.002%

    No Known Activations