INDEX
Explanations
references to emotional responses to medical diagnoses
New Auto-Interp
Negative Logits
bih
-0.15
adar
-0.14
Motion
-0.14
idon
-0.14
iros
-0.14
uddy
-0.14
arger
-0.13
Ñĸ
-0.13
rip
-0.13
anner
-0.13
POSITIVE LOGITS
urance
0.15
ressive
0.15
ÑĤоÑĤ
0.14
apat
0.14
-floating
0.14
عاد
0.13
ạn
0.13
enden
0.13
DataExchange
0.13
tempts
0.13
Activations Density 0.324%