INDEX
Explanations
references to individuals dealing with health-related issues
New Auto-Interp
Negative Logits
ViewFeatures
-0.69
AssemblyCulture
-0.65
MessageOf
-0.61
مشين
-0.57
ThroughAttribute
-0.55
intercession
-0.53
:][
-0.52
ModelExpression
-0.52
delwed
-0.51
-0.51
POSITIVE LOGITS
who
0.97
who
0.81
whoſe
0.79
ktorí
0.72
quienes
0.70
whofe
0.69
kteří
0.69
Whose
0.68
którzy
0.67
Whose
0.65
Activations Density 0.320%