INDEX
    Explanations

    references to the passage of time

    New Auto-Interp
    Negative Logits
    addock
    -0.15
    ucz
    -0.14
    adium
    -0.14
    udoku
    -0.14
     Sad
    -0.14
    enger
    -0.14
     Baghd
    -0.14
    aily
    -0.14
    ursal
    -0.14
    errupt
    -0.14
    POSITIVE LOGITS
     passing
    0.66
     passage
    0.62
     passed
    0.59
     Passing
    0.55
     Passage
    0.53
     passes
    0.52
     Passed
    0.50
     pass
    0.47
    Passed
    0.47
    -pass
    0.46
    Act Density 0.078%

    No Known Activations