INDEX
Explanations
information related to health and medical procedures
New Auto-Interp
Negative Logits
wom
-0.61
stre
-0.59
Bos
-0.56
"{-0.56
builder
-0.56
arms
-0.54
PTS
-0.54
mornings
-0.51
abuser
-0.51
âĵĺ
-0.50
POSITIVE LOGITS
000
1.51
00
1.50
06
1.35
09
1.34
05
1.34
500
1.34
07
1.33
5
1.33
08
1.31
0
1.30
Activations Density 2.046%