INDEX
Explanations
phrases related to expressing concerns or worries
mentions of concerns and issues related to health and safety
New Auto-Interp
Negative Logits
Doodle
-0.81
precincts
-0.78
dances
-0.76
aunder
-0.73
heid
-0.73
azon
-0.72
OUP
-0.70
equivalents
-0.70
å§
-0.70
haw
-0.66
POSITIVE LOGITS
undue
0.80
cens
0.78
severe
0.77
excessive
0.74
Patients
0.70
prolonged
0.69
premature
0.68
preventing
0.67
misinformation
0.67
heightened
0.66
Activations Density 0.151%