INDEX
    Explanations

    mentions of familial relationships, particularly focusing on siblings and brothers

    New Auto-Interp
    Negative Logits
    Население
    -0.39
    ceptre
    -0.38
     manifiesto
    -0.38
     словарь
    -0.36
     broad
    -0.36
     Stoke
    -0.36
     köp
    -0.35
     Kirkwood
    -0.34
    ätigung
    -0.34
    MADRID
    -0.34
    POSITIVE LOGITS
    Brothers
    1.15
     Brothers
    1.14
     Sisters
    1.06
     BROTHERS
    1.05
    brothers
    1.05
     sisters
    1.02
    Sisters
    1.00
    sisters
    0.98
     Bros
    0.95
     brothers
    0.94
    Act Density 0.081%

    No Known Activations