INDEX
    Explanations

    words related to categories or types

    phrases that categorize or describe entities or concepts using terms like "kind" and "sort."

    New Auto-Interp
    Negative Logits
     VIDEOS
    -0.81
     tyres
    -0.71
    assies
    -0.68
     lobb
    -0.68
    obiles
    -0.67
    apses
    -0.65
     saves
    -0.64
     bolts
    -0.64
    itars
    -0.63
    rencies
    -0.63
    POSITIVE LOGITS
    worker
    0.75
    icum
    0.71
    hered
    0.71
    mate
    0.67
    edge
    0.66
    ier
    0.65
     subset
    0.65
    oser
    0.64
    bedroom
    0.64
    kit
    0.63
    Act Density 0.102%

    No Known Activations