INDEX
    Explanations

    references to additional or supplementary information

    phrases indicating ongoing political discourse and controversies

    New Auto-Interp
    Negative Logits
    sie
    -0.75
    arial
    -0.71
    tera
    -0.70
    reen
    -0.66
    ãĥĥãĥĪ
    -0.65
    ãĤ©
    -0.61
    orescence
    -0.61
    Border
    -0.61
    idem
    -0.60
    MIT
    -0.60
    POSITIVE LOGITS
     VIDEOS
    0.97
    ONSORED
    0.91
    ĸļ
    0.81
    enegger
    0.80
    osponsors
    0.79
    EStream
    0.77
    ellen
    0.74
    bilt
    0.70
    20439
    0.66
     MORE
    0.65
    Act Density 0.019%

    No Known Activations