INDEX
    Explanations

    phrases indicating time periods or temporal references

    New Auto-Interp
    Negative Logits
    airy
    -0.79
    aeus
    -0.69
    achy
    -0.67
    cit
    -0.66
    etric
    -0.64
    ocus
    -0.62
    inant
    -0.61
    DEF
    -0.61
    ysis
    -0.61
    IGHT
    -0.60
    POSITIVE LOGITS
     RTX
    0.69
     Governors
    0.64
    bury
    0.63
    avez
    0.62
     checkpoints
    0.61
     Palo
    0.60
     Tibet
    0.60
     Shanghai
    0.60
     Chef
    0.59
     2030
    0.59
    Act Density 0.194%

    No Known Activations