INDEX
    Explanations

    references to "Ward" and "wardrobe" indicating contexts related to caregiving and personal space

    New Auto-Interp
    Negative Logits
    apolis
    -0.08
    ÏĤ
    -0.08
    ctors
    -0.07
    alted
    -0.07
    unci
    -0.07
    ittel
    -0.07
    curity
    -0.07
    ëħIJ
    -0.07
    ollen
    -0.07
    ordin
    -0.07
    POSITIVE LOGITS
    robe
    0.11
    ship
    0.07
    abouts
    0.06
    roots
    0.06
     bud
    0.06
     
    0.06
    uman
    0.06
    low
    0.06
    craft
    0.05
    ign
    0.05
    Act Density 0.009%

    No Known Activations