INDEX
    Explanations

    mentions of interactions with a specific group of people, such as colleagues or associates

    references to groups of people, particularly those labeled as "fellow."

    New Auto-Interp
    Negative Logits
    uilt
    -0.69
     livest
    -0.62
    _-
    -0.61
     Sue
    -0.60
    creen
    -0.60
    ussy
    -0.60
    âĵĺ
    -0.59
    anwhile
    -0.59
    onics
    -0.59
    itton
    -0.59
    POSITIVE LOGITS
     traveler
    1.22
     travelers
    1.17
     travellers
    1.06
     strugg
    1.05
    worldly
    1.01
     traveller
    0.95
     classmates
    0.92
    workers
    0.79
    worker
    0.79
     inmate
    0.78
    Act Density 0.071%

    No Known Activations