INDEX
    Explanations

    explicit sexual content and vulgar language

    New Auto-Interp
    Negative Logits
    bootstrapcdn
    -0.67
     său
    -0.59
    对其
    -0.58
     posiada
    -0.57
    mektedir
    -0.57
     iż
    -0.57
     pertanto
    -0.56
     således
    -0.56
     Chwiliwch
    -0.56
    将其
    -0.55
    POSITIVE LOGITS
     cuz
    0.73
     forever
    0.71
     tonight
    0.67
     til
    0.66
     wrong
    0.65
     sometimes
    0.64
     till
    0.61
     weirdly
    0.60
     outta
    0.60
     everywhere
    0.60
    Act Density 0.941%

    No Known Activations