INDEX
    Explanations

    phrases indicating time references or temporal relationships

    New Auto-Interp
    Negative Logits
    orks
    -0.18
    CompatActivity
    -0.17
    EEK
    -0.17
    oz
    -0.17
    riel
    -0.17
    /layouts
    -0.16
    lius
    -0.15
    tridge
    -0.15
    onis
    -0.14
    rench
    -0.14
    POSITIVE LOGITS
     after
    0.23
    after
    0.23
     posterior
    0.21
     during
    0.20
     después
    0.20
    _after
    0.19
     поÑģле
    0.19
     AFTER
    0.19
    post
    0.19
     aft
    0.19
    Act Density 0.055%

    No Known Activations