INDEX
    Explanations

    instances of subscription and newsletter-related terms

    New Auto-Interp
    Negative Logits
    vap
    -0.18
    anced
    -0.15
    lica
    -0.15
    .ws
    -0.15
    upp
    -0.14
    лиÑĨ
    -0.14
    εί
    -0.14
    hung
    -0.14
    ngine
    -0.13
    ège
    -0.13
    POSITIVE LOGITS
    orrh
    0.15
    angan
    0.14
     kav
    0.14
    IFS
    0.13
     Mormon
    0.13
    strain
    0.13
     /*!<
    0.13
     Erotik
    0.13
     Markus
    0.13
     Jane
    0.13
    Act Density 0.012%

    No Known Activations