INDEX
    Explanations

    comparisons between natural and chemical substances, as well as mentions of age groups and specific gender identities

    New Auto-Interp
    Negative Logits
     fuf
    -1.67
     reluct
    -1.65
     Intere
    -1.61
     disagre
    -1.60
     depic
    -1.59
     desir
    -1.56
     increa
    -1.55
     inev
    -1.53
     emphat
    -1.53
     ?...
    -1.52
    POSITIVE LOGITS
     otherwise
    0.65
    setOpaque
    0.62
    0.61
     alike
    0.61
    بالإنجليزية
    0.60
    بالإ
    0.60
     ones
    0.60
    película
    0.59
     viewWillAppear
    0.59
     beyond
    0.59
    Act Density 0.140%

    No Known Activations