INDEX
Explanations
references to vaccines and discussions related to public health policies
New Auto-Interp
Negative Logits
jen
-0.15
majority
-0.14
kin
-0.14
anywhere
-0.13
goodwill
-0.13
slightest
-0.13
éĢ
-0.13
eras
-0.13
548
-0.13
uter
-0.12
POSITIVE LOGITS
topic
0.38
question
0.35
possibility
0.32
issue
0.32
role
0.31
prospects
0.31
topic
0.31
subject
0.30
effect
0.28
fate
0.28
Activations Density 0.225%