INDEX
    Explanations

    descriptive phrases pertaining to the quality or nature of experiences or items

    New Auto-Interp
    Negative Logits
     ModelExpression
    -0.88
     itſelf
    -0.87
     Efq
    -0.82
     Majefty
    -0.82
     myſelf
    -0.76
     Chriftian
    -0.76
     himſelf
    -0.74
    ethene
    -0.73
    ruik
    -0.72
    sizeCache
    -0.70
    POSITIVE LOGITS
     easy
    0.64
     a
    0.62
     hard
    0.62
     very
    0.60
     difficult
    0.58
     time
    0.57
     simple
    0.50
     the
    0.50
     normal
    0.49
     full
    0.49
    Act Density 0.180%

    No Known Activations