INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     repeats
    -0.07
    ‌کننده
    -0.07
    .GREEN
    -0.07
     انتشار
    -0.06
     للم
    -0.06
     имеют
    -0.06
    ικοί
    -0.06
    ماری
    -0.06
    fib
    -0.06
    /includes
    -0.06
    POSITIVE LOGITS
    .Create
    0.06
    vrolet
    0.06
     Swamp
    0.06
     slave
    0.06
    0.06
     Disclaimer
    0.06
     leds
    0.06
     BTS
    0.06
    wolf
    0.06
     PHI
    0.06
    Act Density 0.051%

    No Known Activations