INDEX
Explanations
phrases related to health care access and political engagement
New Auto-Interp
Negative Logits
chia
-0.15
zin
-0.15
ìĦľëĬĶ
-0.15
âĪı
-0.14
ifar
-0.14
Shut
-0.14
ammen
-0.13
Flynn
-0.13
Äĥm
-0.13
conv
-0.13
POSITIVE LOGITS
to
0.19
ãĢįãĤĴ
0.16
_Tis
0.15
να
0.15
à¥Įद
0.14
(*((
0.14
ufs
0.14
IALIZED
0.14
us
0.13
aby
0.13
Activations Density 0.402%