INDEX
    Explanations

    alternating series and negative numbers

    New Auto-Interp
    Negative Logits
     sender
    0.44
     vandal
    0.43
    ओसी
    0.42
    седнев
    0.42
     vand
    0.40
     phishing
    0.40
     vandalism
    0.40
     flood
    0.39
     sensitive
    0.39
     submission
    0.39
    POSITIVE LOGITS
    =[-
    0.48
    0.47
    negative
    0.44
     नेगेटिव
    0.42
     Negative
    0.41
    0.41
    <0xE2>
    0.39
    ="-
    0.39
     $[-
    0.38
     (−
    0.38
    Act Density 0.000%

    No Known Activations