INDEX
Explanations
references to mentorship roles and relationships
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.06
3:0.05
4:0.13
5:0.02
6:0.03
7:0.36
8:0.03
9:0.03
10:0.11
11:0.09
Negative Logits
fle
-1.55
ヴ
-1.41
shell
-1.38
cation
-1.37
bang
-1.34
isd
-1.34
plet
-1.30
ヴァ
-1.29
preserves
-1.29
buquerque
-1.29
POSITIVE LOGITS
tutor
1.81
Scholar
1.69
younger
1.67
mentors
1.67
youngsters
1.54
fellow
1.52
Teach
1.51
ework
1.51
young
1.48
Spur
1.47
Activations Density 0.001%