INDEX
    Explanations

    profanity and vulgarity

    New Auto-Interp
    Negative Logits
    Bi
    0.97
    Slice
    0.92
    जान
    0.90
    Advantages
    0.86
    Body
    0.85
    Дру
    0.83
    Height
    0.83
    Length
    0.83
     सज्
    0.82
    Lone
    0.81
    POSITIVE LOGITS
     heap
    1.27
     strewn
    1.27
    ocurrency
    1.19
    heap
    1.18
    doFilter
    1.16
     basura
    1.14
    fuck
    1.13
     garbage
    1.08
     jokes
    1.06
    ulence
    1.06
    Act Density 0.216%

    No Known Activations