INDEX
    Explanations

    names, titles, or labels starting with "Prim"

    references to a specific term, "Prim," and its variations in context

    New Auto-Interp
    Negative Logits
    lihood
    -0.75
    IGH
    -0.73
    人
    -0.67
     Hanson
    -0.65
    jriwal
    -0.65
    kson
    -0.64
    hiro
    -0.63
    hold
    -0.61
     Bard
    -0.61
     Cheong
    -0.59
    POSITIVE LOGITS
    itives
    1.53
    ordial
    1.43
    itive
    1.42
    etime
    1.28
    arily
    1.23
    rose
    1.18
    acy
    1.07
    aries
    1.01
    itiveness
    0.99
    ework
    0.92
    Act Density 0.025%

    No Known Activations