INDEX
    Explanations

    mentions of historical events or figures

    sequences of numerical values or ratings associated with content

    New Auto-Interp
    Negative Logits
    ©¶æ
    -0.89
    ulkan
    -0.75
    nesday
    -0.73
    wagen
    -0.71
    ertodd
    -0.69
    iga
    -0.65
     overth
    -0.65
     partisans
    -0.64
     <[
    -0.64
    utsche
    -0.64
    POSITIVE LOGITS
    SPONSORED
    1.00
    PHOTOS
    0.95
    Advertisement
    0.82
    AUT
    0.82
    Writing
    0.82
    é¾
    0.82
    Scroll
    0.80
    Age
    0.80
    Known
    0.77
    Their
    0.76
    Act Density 0.694%

    No Known Activations