INDEX
    Explanations

    phrases that refer to inclusivity and the presence of multiple elements or features

    New Auto-Interp
    Negative Logits
    and
    -0.54
     scot
    -0.49
    that
    -0.49
    C
    -0.48
    tro
    -0.47
    ally
    -0.47
    which
    -0.46
    baomidou
    -0.46
    retario
    -0.45
    instead
    -0.45
    POSITIVE LOGITS
     includes
    2.82
     include
    2.49
     Includes
    2.47
    Includes
    2.34
    includes
    2.30
     INCLUDES
    2.10
     inclui
    1.99
     Include
    1.99
     incluye
    1.99
     INCLUDE
    1.81
    Act Density 0.171%

    No Known Activations