INDEX
    Explanations

    phrases indicating familiarity or knowledge with a subject

    references to familiarity or knowledge of concepts, entities, or experiences

    New Auto-Interp
    Negative Logits
    ishers
    -0.74
    cers
    -0.72
    eters
    -0.68
    gencies
    -0.68
    itter
    -0.68
    aredevil
    -0.68
    isher
    -0.65
    isco
    -0.63
    anmar
    -0.63
    pex
    -0.63
    POSITIVE LOGITS
    lege
    1.04
     firsthand
    0.91
    ledge
    0.90
     anecd
    0.79
     intimately
    0.73
     existed
    0.69
     exists
    0.68
    terday
    0.65
     instinctively
    0.63
    âĦ¢:
    0.63
    Act Density 0.056%

    No Known Activations