INDEX
Explanations
references to health metrics, specifically those involving evaluations and scales of well-being
New Auto-Interp
Negative Logits
Autoritní
-0.59
makeConstraints
-0.50
lẻ
-0.49
gogo
-0.43
klo
-0.43
CURIAM
-0.43
שוליים
-0.42
exitRule
-0.42
ropol
-0.42
λε
-0.41
POSITIVE LOGITS
UnsafeEnabled
0.80
Мексичка
0.76
scale
0.71
rating
0.66
skala
0.63
насељу
0.63
ratings
0.62
scaled
0.62
Савезне
0.61
척
0.59
Activations Density 0.490%