INDEX
Explanations
references to kinship or family relationships
New Auto-Interp
Negative Logits
')){-0.57
()]);
-0.55
']);
-0.55
"]);
-0.54
']){-0.53
")){
-0.52
'])){
-0.50
'}>
-0.50
()}}
-0.49
}));
-0.49
POSITIVE LOGITS
kin
0.71
betweenstory
0.69
KIN
0.59
Kin
0.58
ки
0.57
Kin
0.57
Kib
0.56
kim
0.56
ki
0.54
0.54
Activations Density 1.558%