INDEX
    Explanations

    phrases related to news headlines

    instances of dashes or other symbols indicating breaks in text

    New Auto-Interp
    Negative Logits
    wagen
    -0.79
    spir
    -0.65
    deals
    -0.64
    interstitial
    -0.64
     servicing
    -0.63
    bour
    -0.62
    stones
    -0.62
    agate
    -0.61
    heric
    -0.61
    range
    -0.61
    POSITIVE LOGITS
     Comments
    0.86
     Advertisement
    0.81
    =-=-=-=-
    0.73
    ––
    0.73
     Transcript
    0.71
    ADVERTISEMENT
    0.70
     Fever
    0.69
    =-=-=-=-=-=-=-=-
    0.68
    Edited
    0.68
    ĸļ
    0.66
    Act Density 0.049%

    No Known Activations