INDEX
    Explanations

    instances of articles or posts

    New Auto-Interp
    Negative Logits
    ekli
    -0.15
    ilim
    -0.15
    akov
    -0.15
    WG
    -0.14
    acht
    -0.14
    VOKE
    -0.14
    åħ¼
    -0.14
    ixture
    -0.14
    apot
    -0.13
    achat
    -0.13
    POSITIVE LOGITS
     Previous
    0.30
    Previous
    0.28
     article
    0.26
     Post
    0.26
     Article
    0.25
     story
    0.23
     Entry
    0.22
     Story
    0.21
     post
    0.20
     previous
    0.18
    Act Density 0.010%

    No Known Activations