INDEX
Explanations
key lessons or insights mentioned in a text
phrases that refer to lessons learned or teachings
New Auto-Interp
Negative Logits
occupancy
-0.74
omin
-0.71
trak
-0.68
umbers
-0.64
FP
-0.64
BLIC
-0.62
chairs
-0.62
hw
-0.62
endars
-0.62
ãĥ¢
-0.61
POSITIVE LOGITS
Learned
1.55
learned
1.50
learnt
1.42
lesson
1.24
lessons
1.23
Lear
1.12
learn
1.10
taught
1.03
glean
0.93
Lessons
0.89
Activations Density 0.051%