INDEX
    Explanations

    mentions of BBC affiliations and related content

    New Auto-Interp
    Negative Logits
    utdown
    -0.17
    agen
    -0.16
    sep
    -0.14
    neau
    -0.14
    .transforms
    -0.14
    idenav
    -0.14
    BCM
    -0.14
    layan
    -0.14
    кÑĥÑĤ
    -0.13
    ablish
    -0.13
    POSITIVE LOGITS
    usercontent
    0.16
    à¸IJ
    0.15
    lectron
    0.15
    plex
    0.14
    531
    0.14
    oÅĪ
    0.14
    704
    0.14
    ropy
    0.13
    пон
    0.13
    yz
    0.13
    Act Density 0.020%

    No Known Activations