INDEX
    Explanations

    key lessons or insights mentioned in a text

    phrases that refer to lessons learned or teachings

    New Auto-Interp
    Negative Logits
     occupancy
    -0.74
    omin
    -0.71
    trak
    -0.68
    umbers
    -0.64
    FP
    -0.64
    BLIC
    -0.62
    chairs
    -0.62
    hw
    -0.62
    endars
    -0.62
    ãĥ¢
    -0.61
    POSITIVE LOGITS
     Learned
    1.55
     learned
    1.50
     learnt
    1.42
     lesson
    1.24
     lessons
    1.23
    Lear
    1.12
    learn
    1.10
     taught
    1.03
     glean
    0.93
     Lessons
    0.89
    Act Density 0.051%

    No Known Activations