INDEX
Explanations
references to mentorship and mentors
New Auto-Interp
Negative Logits
leg
-0.57
<i>
-0.57
ec
-0.54
</i>
-0.54
*>(
-0.54
tec
-0.53
vå
-0.52
中了
-0.51
legs
-0.51
wehr
-0.51
POSITIVE LOGITS
mentors
1.59
mentor
1.57
mentoring
1.45
mentorship
1.41
Mentor
1.36
mentor
1.24
Mentoring
1.24
trainers
1.19
Mentor
1.14
trainer
1.13
Activations Density 0.059%