INDEX
Explanations
terms related to lessons learned or recommendations for improvement
New Auto-Interp
Negative Logits
nodoc
-0.65
Gore
-0.55
push
-0.50
Murdock
-0.50
هيا
-0.49
PUSH
-0.49
unesse
-0.48
corris
-0.47
Zust
-0.47
Bege
-0.47
POSITIVE LOGITS
lessons
1.13
Lessons
1.09
Lessons
1.06
conclusions
1.03
lesson
1.01
learnings
0.96
takeaways
0.95
lessons
0.94
leçons
0.93
Lesson
0.89
Activations Density 0.399%