INDEX
Explanations
elements related to familial relationships and marital status
New Auto-Interp
Negative Logits
feitura
-0.58
+#+
-0.53
face
-0.50
correctes
-0.49
τον
-0.49
tanong
-0.49
MaterialApp
-0.48
生平
-0.47
❉
-0.46
)++;
-0.46
POSITIVE LOGITS
existing
0.87
existing
0.83
Existing
0.81
Existing
0.79
already
0.74
EXISTING
0.73
preexisting
0.72
已经有
0.71
already
0.71
previous
0.71
Activations Density 0.182%