INDEX
Explanations
medical or health-related terms, especially those related to mental health or conditions requiring care and protection, along with terms related to societal issues and controversies
New Auto-Interp
Negative Logits
briefs
-0.77
OPLE
-0.72
MX
-0.67
Architecture
-0.66
azine
-0.65
phrine
-0.64
Blitz
-0.64
Dynamics
-0.63
simulations
-0.63
retreat
-0.63
POSITIVE LOGITS
assuming
1.25
balanced
1.24
ruly
1.20
readable
1.18
numbered
1.18
earned
1.15
ifying
1.15
qualified
1.14
animous
1.12
ortunately
1.12
Activations Density 2.163%