INDEX
    Explanations

    references to familial relationships, particularly focusing on spouses and partners

    New Auto-Interp
    Negative Logits
    帖最后由
    -0.70
    iprot
    -0.62
     Kriege
    -0.60
    junge
    -0.59
     sqlSession
    -0.59
     GTX
    -0.58
     barbers
    -0.57
     Canaria
    -0.57
     Stiles
    -0.57
    حياته
    -0.57
    POSITIVE LOGITS
     sector
    0.57
     gerekiyor
    0.56
    herst
    0.54
    шила
    0.52
    dema
    0.52
     verla
    0.51
    sector
    0.50
    Sector
    0.48
     Sector
    0.48
     архивлан
    0.48
    Act Density 0.028%

    No Known Activations