INDEX
Explanations
historical or insightful lessons gleaned from various contexts
references to lessons and learning experiences
New Auto-Interp
Negative Logits
omin
-0.72
yy
-0.72
trak
-0.67
urat
-0.63
uve
-0.63
ãĥ¢
-0.63
gur
-0.63
occupancy
-0.62
yip
-0.61
ãĥı
-0.60
POSITIVE LOGITS
Learned
1.67
learned
1.62
learnt
1.55
lesson
1.26
lessons
1.22
Lear
1.12
learn
1.12
taught
1.08
glean
1.01
Lessons
0.93
Activations Density 0.078%