INDEX
    Explanations

    phrases related to various news events and articles

    New Auto-Interp
    Negative Logits
    handedly
    -0.70
    naires
    -0.67
    bats
    -0.66
    gran
    -0.64
    ĪĴ
    -0.62
    knife
    -0.61
    nov
    -0.61
    ogy
    -0.60
    rams
    -0.60
    gone
    -0.59
    POSITIVE LOGITS
     WATCHED
    0.93
     Expand
    0.92
     Thumbnails
    0.88
     VIDEOS
    0.84
     Loading
    0.84
     toggle
    0.83
     caption
    0.78
     IMAGES
    0.78
     Advertisement
    0.77
    advertisement
    0.77
    Act Density 3.371%

    No Known Activations