INDEX
    Explanations

    phrases related to living situations and community contexts

    New Auto-Interp
    Negative Logits
    inges
    -0.16
    adb
    -0.16
    aterno
    -0.15
    uÃŃ
    -0.14
    itest
    -0.14
    nk
    -0.14
    æŃ£
    -0.14
    ÑĢей
    -0.14
    Forum
    -0.14
    ̣
    -0.14
    POSITIVE LOGITS
     constant
    0.15
    -relative
    0.15
    poons
    0.15
    ultip
    0.15
    Ùħس
    0.15
     relative
    0.15
    .cfg
    0.14
    ionales
    0.14
    leston
    0.14
    ülük
    0.14
    Act Density 0.106%

    No Known Activations