INDEX
    Explanations

    references to lessons learned or educational takeaways

    New Auto-Interp
    Negative Logits
    еви
    -0.19
    kle
    -0.15
    leston
    -0.15
    hed
    -0.15
     Hedge
    -0.15
    ÙĪØ¯ÛĮ
    -0.14
    elman
    -0.14
    åĪĩãĤĬ
    -0.14
    ihn
    -0.14
     hurd
    -0.13
    POSITIVE LOGITS
     lessons
    0.22
     lesson
    0.19
     Lesson
    0.18
     Lessons
    0.17
     ÑĥÑĢок
    0.16
    Lesson
    0.15
    ç
    0.15
    ower
    0.15
    lesson
    0.15
     flexGrow
    0.14
    Act Density 0.072%

    No Known Activations