INDEX
Explanations
phrases related to health conditions or medical information
phrases involving people categorized by their conditions or characteristics
New Auto-Interp
Negative Logits
press
-0.68
abre
-0.64
365
-0.64
roundup
-0.62
review
-0.62
println
-0.61
obi
-0.61
advertising
-0.60
Fed
-0.59
waters
-0.59
POSITIVE LOGITS
stood
1.58
regard
1.21
drawn
1.14
whom
1.13
regards
1.12
standing
1.01
impunity
1.00
disabilities
0.97
respect
0.95
held
0.91
Activations Density 0.123%