INDEX
    Explanations

    phrases that indicate insights, revelations, or the sharing of knowledge and experiences

    New Auto-Interp
    Negative Logits
    enk
    -0.16
    avern
    -0.15
    isha
    -0.15
    pch
    -0.15
     interiors
    -0.15
    otu
    -0.14
    cord
    -0.14
    anz
    -0.14
    xon
    -0.14
    avig
    -0.14
    POSITIVE LOGITS
     lessons
    0.32
     lesson
    0.31
     insights
    0.29
     Lesson
    0.29
     insight
    0.28
    Lesson
    0.27
    lessons
    0.26
     learn
    0.26
     Lessons
    0.26
    lesson
    0.26
    Act Density 0.018%

    No Known Activations