INDEX
    Explanations

    references to news articles or headlines

    instances of the word 'to' in various contexts

    New Auto-Interp
    Negative Logits
    gling
    -0.74
    phas
    -0.73
    cientious
    -0.72
    ssh
    -0.69
    ographically
    -0.69
    edIn
    -0.68
    Ö¼
    -0.66
    ibly
    -0.65
    mercial
    -0.65
    eson
    -0.64
    POSITIVE LOGITS
     Menu
    0.87
     Contents
    0.86
     Main
    0.85
     Top
    0.85
     Gallery
    0.84
     TOP
    0.79
     Bottom
    0.78
     PAGE
    0.77
     Table
    0.77
     grid
    0.74
    Act Density 0.042%

    No Known Activations