INDEX
Explanations
terms related to familial relationships
New Auto-Interp
Negative Logits
rades
-0.16
asse
-0.15
urga
-0.14
ÙĪØ±Ùĩ
-0.14
aso
-0.13
emales
-0.13
Jensen
-0.13
uder
-0.13
rup
-0.13
ãģ«éĸ¢ãģĻãĤĭ
-0.13
POSITIVE LOGITS
of
0.49
cá»§a
0.35
of
0.28
à¸Ĥà¸Ńà¸ĩà¸ľ
0.27
à¸Ĥà¸Ńà¸ĩ
0.27
าà¸Ĥà¸Ńà¸ĩ
0.21
à¸Ĥà¸Ńà¸ĩ
0.21
_of
0.21
à¹Įà¸Ĥà¸Ńà¸ĩ
0.21
à¸Ĥà¸Ńà¸ĩร
0.20
Activations Density 0.059%