INDEX
    Explanations

    references to specific articles or products within the text

    New Auto-Interp
    Negative Logits
    aus
    -0.81
    bats
    -0.72
    nings
    -0.69
    nown
    -0.69
    Ĭ±
    -0.68
     Izan
    -0.66
     Palest
    -0.64
     Mund
    -0.62
     Gavin
    -0.62
    ornings
    -0.61
    POSITIVE LOGITS
     article
    0.96
     item
    0.91
     ARTICLE
    0.87
     topic
    0.86
     slideshow
    0.85
     particular
    0.84
     repository
    0.82
     wiki
    0.82
     trope
    0.81
     addon
    0.79
    Act Density 0.060%

    No Known Activations