INDEX
    Explanations

    terms associated with familial relationships and friendships

    New Auto-Interp
    Negative Logits
    OGND
    -0.63
    UserScript
    -0.57
     مرئيه
    -0.57
    Diweddarwch
    -0.54
     تانيه
    -0.53
     transfieras
    -0.51
    อื่น
    -0.49
     AssemblyTitle
    -0.48
     autres
    -0.48
     Otros
    -0.47
    POSITIVE LOGITS
     home
    0.40
     Bible
    0.40
    alapa
    0.40
     Dandy
    0.39
     Hiller
    0.39
    对着
    0.38
    0.38
     fris
    0.38
     Dome
    0.38
     actual
    0.37
    Act Density 0.064%

    No Known Activations