INDEX
    Explanations

    references to kinship or family relationships

    New Auto-Interp
    Negative Logits
    ')){
    -0.57
    ()]);
    -0.55
    ']);
    
    -0.55
    "]);
    
    -0.54
    ']){
    -0.53
    ")){
    
    -0.52
    '])){
    
    -0.50
    '}>
    -0.50
    ()}}
    -0.49
     }));
    -0.49
    POSITIVE LOGITS
     kin
    0.71
     betweenstory
    0.69
     KIN
    0.59
    Kin
    0.58
     ки
    0.57
     Kin
    0.57
     Kib
    0.56
     kim
    0.56
     ki
    0.54
    
    0.54
    Act Density 1.558%

    No Known Activations