INDEX
    Explanations

    phrases indicating a point in time or perspective

    New Auto-Interp
    Negative Logits
     twin
    -0.68
     twins
    -0.63
    amaz
    -0.61
    harm
    -0.61
     Cities
    -0.60
    ours
    -0.59
    lez
    -0.58
    discrimination
    -0.56
     warranties
    -0.56
     subst
    -0.56
    POSITIVE LOGITS
    cture
    0.86
     onwards
    0.83
     onward
    0.77
    ebin
    0.70
    abouts
    0.70
    endment
    0.69
    ozo
    0.63
    ajor
    0.63
    gue
    0.62
    EStreamFrame
    0.61
    Act Density 0.035%

    No Known Activations