INDEX
    Explanations

    mentions of the word "book."

    New Auto-Interp
    Negative Logits
    ilitary
    -0.75
     Bots
    -0.72
    sembly
    -0.68
    xon
    -0.67
     distant
    -0.65
     twitch
    -0.64
    cffff
    -0.64
     Yin
    -0.64
     Lumpur
    -0.64
     VIDEOS
    -0.62
    POSITIVE LOGITS
    stores
    1.49
    seller
    1.27
    marks
    1.27
    shop
    1.14
    marked
    1.11
    cases
    1.09
    worms
    1.08
    worm
    1.05
    book
    1.03
    keeping
    1.02
    Act Density 0.038%

    No Known Activations