INDEX
    Explanations

    references to people and their actions or states of being

    New Auto-Interp
    Negative Logits
    wheel
    -0.15
    nila
    -0.15
    .mit
    -0.14
    еÑģа
    -0.14
    ERC
    -0.14
     Wheel
    -0.14
     wheel
    -0.14
    avo
    -0.14
    ifle
    -0.14
    onder
    -0.14
    POSITIVE LOGITS
    inx
    0.17
     erotiske
    0.15
    orz
    0.15
     nakne
    0.15
     pornofilm
    0.15
    otti
    0.14
     اÙĦرسÙħÙĬ
    0.14
    twig
    0.14
    èħ¹
    0.14
    coni
    0.13
    Act Density 0.182%

    No Known Activations