INDEX
    Explanations

    references to specific individuals and their claims related to age discrimination and cancer

    New Auto-Interp
    Negative Logits
    inker
    -0.15
    зна
    -0.14
    arken
    -0.14
    appiness
    -0.13
    itud
    -0.13
    imiter
    -0.13
     census
    -0.13
     Mahm
    -0.13
    ROP
    -0.13
     architect
    -0.13
    POSITIVE LOGITS
     meta
    0.35
    meta
    0.30
     Meta
    0.30
    Meta
    0.30
    .meta
    0.28
    /meta
    0.26
    -meta
    0.26
    .Meta
    0.24
    _meta
    0.23
    META
    0.22
    Act Density 0.018%

    No Known Activations