INDEX
    Explanations

    references to specific numerical data, particularly in relation to news articles and events

    New Auto-Interp
    Negative Logits
    aters
    -0.17
    iglia
    -0.16
    led
    -0.16
    ÏħÏĦÏĮ
    -0.15
    iggs
    -0.15
    ly
    -0.15
    lero
    -0.15
    igel
    -0.14
    tml
    -0.14
    uent
    -0.14
    POSITIVE LOGITS
    veh
    0.32
    jab
    0.22
    pline
    0.17
    nicos
    0.16
    анÑĤи
    0.15
    venes
    0.15
    éĥİ
    0.15
    erez
    0.15
    ’te
    0.15
    enberg
    0.15
    Act Density 0.080%

    No Known Activations