INDEX
    Explanations

    phrases indicating personal possession or relationships

    New Auto-Interp
    Negative Logits
    eniz
    -0.15
    errupted
    -0.14
    _singleton
    -0.14
    adium
    -0.14
    ceso
    -0.14
    Behaviour
    -0.14
    rud
    -0.13
    kÄĻ
    -0.13
    beans
    -0.13
    acement
    -0.13
    POSITIVE LOGITS
     projects
    0.19
    projects
    0.18
     Projects
    0.18
    Projects
    0.17
    _projects
    0.16
    auge
    0.15
    /projects
    0.15
    isos
    0.14
     products
    0.14
    clus
    0.14
    Act Density 0.017%

    No Known Activations