INDEX
Explanations
references to the treatment and well-being of individuals, particularly marginalized groups
New Auto-Interp
Negative Logits
脚注の使い方
-0.64
ScopeManager
-0.56
uLocal
-0.49
RTSC
-0.47
ComVisible
-0.42
estacks
-0.41
ویکیپدیا
-0.41
-0.41
TokenNameDOT
-0.40
Availability
-0.40
POSITIVE LOGITS
supported
1.14
cared
1.05
treated
0.95
served
0.85
assisted
0.84
catered
0.82
supported
0.81
protected
0.81
serviced
0.79
attended
0.78
Activations Density 0.496%