INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .math
    -0.07
    -0.07
    -feed
    -0.07
     celebrated
    -0.07
    ino
    -0.07
     روند
    -0.06
    .frames
    -0.06
     midpoint
    -0.06
    -0.06
    stood
    -0.06
    POSITIVE LOGITS
     disgusting
    0.07
     USB
    0.06
     coch
    0.06
    Prog
    0.06
    conv
    0.06
    InlineData
    0.05
     HOR
    0.05
     uranus
    0.05
     nhựa
    0.05
     originating
    0.05
    Act Density 0.003%

    No Known Activations