INDEX
Explanations
phrases related to medical conditions or health issues, specifically focusing on individuals with pre-existing conditions
references to people with specific conditions or characteristics
New Auto-Interp
Negative Logits
waters
-0.63
365
-0.62
press
-0.60
:-)
-0.59
fig
-0.59
uzz
-0.59
println
-0.59
review
-0.58
Fed
-0.58
obi
-0.58
POSITIVE LOGITS
stood
1.60
drawn
1.14
disabilities
1.13
regard
1.12
whom
1.11
standing
1.10
regards
1.05
impunity
1.02
holding
1.00
respect
0.98
Activations Density 0.096%