INDEX
Explanations
terms related to societal roles and responsibilities
New Auto-Interp
Negative Logits
Ñıке
-0.17
коÑĤоÑĢое
-0.16
ÙħÙĨÙĩا
-0.15
αÏħÏĦά
-0.13
à¸ļรร
-0.13
uibModal
-0.13
ãģ¯ãģªãģĦ
-0.13
mÄĽlo
-0.12
بÙĪØ§Ø¨Ø©
-0.12
ëħĦìĹIJëĬĶ
-0.12
POSITIVE LOGITS
who
1.39
whom
1.20
who
1.17
Who
0.86
whose
0.85
quien
0.85
Who
0.82
kteÅĻÃŃ
0.72
quienes
0.72
qui
0.72
Activations Density 1.589%