INDEX
Explanations
phrases that indicate insights, revelations, or the sharing of knowledge and experiences
New Auto-Interp
Negative Logits
enk
-0.16
avern
-0.15
isha
-0.15
pch
-0.15
interiors
-0.15
otu
-0.14
cord
-0.14
anz
-0.14
xon
-0.14
avig
-0.14
POSITIVE LOGITS
lessons
0.32
lesson
0.31
insights
0.29
Lesson
0.29
insight
0.28
Lesson
0.27
lessons
0.26
learn
0.26
Lessons
0.26
lesson
0.26
Activations Density 0.018%