INDEX
    Explanations

    numerical figures, particularly ones related to items for sale

    numerical identifiers or rankings, likely related to specific entities or categories

    New Auto-Interp
    Negative Logits
    gerald
    -0.80
    manship
    -0.63
     "$:/
    -0.60
    enegger
    -0.58
    ural
    -0.56
    rolet
    -0.56
     Ik
    -0.56
    utan
    -0.55
     sway
    -0.55
    ured
    -0.54
    POSITIVE LOGITS
    nd
    2.09
    ND
    1.19
    133
    1.09
    160
    1.09
    147
    1.07
    245
    0.98
    187
    0.98
     externalToEVAOnly
    0.94
     thirds
    0.94
    155
    0.93
    Act Density 0.123%

    No Known Activations