INDEX
    Explanations

    references to second place or ranking

    occurrences of the word "second."

    New Auto-Interp
    Negative Logits
    LOD
    -0.72
    aceae
    -0.71
    embed
    -0.64
    nuts
    -0.63
    avid
    -0.62
    HUD
    -0.61
    ains
    -0.60
    xual
    -0.59
    Features
    -0.58
    quit
    -0.57
    POSITIVE LOGITS
     second
    3.29
    second
    2.62
     third
    2.40
     fourth
    2.35
    Second
    2.22
     Second
    2.11
     fifth
    2.07
     sixth
    2.04
     seventh
    1.97
     secondly
    1.93
    Act Density 0.030%

    No Known Activations