INDEX
    Explanations

    detailed explanations or descriptions within text

    the term "description" and its variations

    New Auto-Interp
    Negative Logits
    nuts
    -0.79
    ced
    -0.69
    yrus
    -0.68
    abb
    -0.67
    cot
    -0.66
    enthal
    -0.66
    acus
    -0.66
    inth
    -0.65
    ergic
    -0.64
    lah
    -0.62
    POSITIVE LOGITS
     descriptions
    1.04
     description
    1.03
    REDACTED
    0.86
    description
    0.83
     synopsis
    0.83
     describ
    0.82
    anguage
    0.82
     thereof
    0.77
     specifications
    0.77
    Description
    0.75
    Act Density 0.014%

    No Known Activations