INDEX
    Explanations

    references to dates or specific time identifiers

    New Auto-Interp
    Negative Logits
    ted
    -0.18
    led
    -0.17
    o
    -0.17
    LED
    -0.17
    oine
    -0.15
    rias
    -0.15
    orious
    -0.15
    eel
    -0.15
    annes
    -0.14
    lant
    -0.14
    POSITIVE LOGITS
    roe
    0.23
    astery
    0.22
    ochrome
    0.20
    itored
    0.20
    ero
    0.19
    soon
    0.19
     mon
    0.18
    sters
    0.18
    (mon
    0.18
    tréal
    0.17
    Act Density 0.016%

    No Known Activations