INDEX
    Explanations

    references to periods of time, particularly years and months

    New Auto-Interp
    Negative Logits
    atif
    -0.17
    odus
    -0.15
    ailable
    -0.14
    ADA
    -0.14
    anus
    -0.14
    byn
    -0.14
    assis
    -0.14
     \`
    -0.14
    elman
    -0.14
     Scalars
    -0.14
    POSITIVE LOGITS
     ago
    0.53
    ago
    0.40
     Ago
    0.38
     ego
    0.28
    AGO
    0.28
     back
    0.28
     go
    0.26
     age
    0.25
     назад
    0.24
    go
    0.22
    Act Density 0.027%

    No Known Activations