INDEX
    Explanations

    comparisons of quantities or actions

    phrases that express comparisons or differences in quantities or actions

    New Auto-Interp
    Negative Logits
     Katy
    -0.63
     Miko
    -0.62
     commencement
    -0.62
    susp
    -0.62
     Wear
    -0.61
     Griffin
    -0.57
    ike
    -0.56
     Mou
    -0.56
    Draft
    -0.56
     Christy
    -0.56
    POSITIVE LOGITS
    liest
    0.71
    pees
    0.67
    Downloadha
    0.66
    ibaba
    0.66
     traditionally
    0.65
    hattan
    0.65
    20439
    0.65
    athom
    0.64
     sidx
    0.63
     herself
    0.63
    Act Density 0.164%

    No Known Activations