INDEX
    Explanations

    comparisons of rankings or positions in a list

    phrases indicating rankings or positions in comparison to others

    New Auto-Interp
    Negative Logits
    beh
    -0.72
    Limited
    -0.67
    Register
    -0.66
    oresc
    -0.65
    leeve
    -0.64
    vari
    -0.63
    arant
    -0.63
    trial
    -0.62
    translation
    -0.62
    anu
    -0.62
    POSITIVE LOGITS
    ansas
    0.74
     rivals
    0.72
     fellow
    0.72
     Laksh
    0.71
     Norway
    0.68
     rival
    0.68
     Avatar
    0.67
     Coca
    0.65
     Saud
    0.65
     neighboring
    0.65
    Act Density 0.168%

    No Known Activations