INDEX
Explanations
references to care facilities and their activities
New Auto-Interp
Negative Logits
Disposition
-0.16
Rav
-0.14
pardon
-0.14
EDI
-0.14
ustain
-0.14
saturated
-0.14
sideline
-0.13
λε
-0.13
Master
-0.13
Vuex
-0.13
POSITIVE LOGITS
care
0.27
Care
0.27
dementia
0.24
cared
0.22
domic
0.20
Care
0.20
Supported
0.20
care
0.20
supported
0.19
supported
0.19
Activations Density 0.019%