INDEX
Explanations
phrases related to the protection and promotion of public health and community support
New Auto-Interp
Negative Logits
ppe
-0.15
Urs
-0.14
aub
-0.14
pd
-0.14
lica
-0.14
oops
-0.14
Naked
-0.14
alloca
-0.14
Guar
-0.13
Annunci
-0.13
POSITIVE LOGITS
bsub
0.18
ilet
0.15
utt
0.15
ALER
0.14
uttle
0.13
-desc
0.13
hazi
0.13
ader
0.13
oret
0.13
Ø´Ùĩر
0.13
Activations Density 0.134%