INDEX
    Explanations

    mentions of media events or cultural references

    New Auto-Interp
    Negative Logits
    olia
    -0.17
    olie
    -0.15
    kan
    -0.15
    à¸²à¸ł
    -0.14
    å¡
    -0.14
    еж
    -0.14
    ucas
    -0.13
    ç¼
    -0.13
    å¾ĭ
    -0.13
    ardi
    -0.13
    POSITIVE LOGITS
     bookstore
    0.41
     Books
    0.38
     Book
    0.38
     book
    0.37
     books
    0.35
    Books
    0.34
     bibli
    0.32
    Book
    0.32
     BOOK
    0.31
     книж
    0.31
    Act Density 0.115%

    No Known Activations