INDEX
    Explanations

    instances where the phrase "the first time that" occurs

    New Auto-Interp
    Negative Logits
    aciously
    -0.74
    cosystem
    -0.64
    estern
    -0.62
    roth
    -0.62
    Guard
    -0.59
    usk
    -0.58
    leeve
    -0.58
    IDs
    -0.58
    IVERS
    -0.57
    orah
    -0.57
    POSITIVE LOGITS
     occurs
    0.93
     happens
    0.90
    soever
    0.84
     they
    0.79
     occurred
    0.79
     happened
    0.77
     arose
    0.76
     mattered
    0.76
     transpired
    0.71
     we
    0.70
    Act Density 0.123%

    No Known Activations