INDEX
    Explanations

    time-related expressions such as durations, specific time periods, and actions taking time to complete

    New Auto-Interp
    Negative Logits
    ħĭ
    -0.70
    Ü
    -0.66
    eworld
    -0.65
    ociated
    -0.63
    edIn
    -0.62
    rored
    -0.60
    holm
    -0.59
     Trafford
    -0.58
     kindred
    -0.57
    ²
    -0.57
    POSITIVE LOGITS
     longer
    0.92
     to
    0.90
     before
    0.81
     elapsed
    0.76
     apiece
    0.75
     for
    0.71
     till
    0.69
     til
    0.67
    to
    0.66
     (~
    0.66
    Act Density 0.075%

    No Known Activations