INDEX
Explanations
references to teachers and their impact on students
New Auto-Interp
Negative Logits
imen
-0.17
iliar
-0.16
regional
-0.15
executive
-0.15
orsch
-0.14
apprentices
-0.14
Executive
-0.14
zer
-0.14
managed
-0.14
ää
-0.13
POSITIVE LOGITS
teaching
0.29
Teaching
0.25
teacher
0.23
teach
0.23
ped
0.22
Teacher
0.22
æķĻåѦ
0.21
æİĪ
0.20
пÑĢеп
0.20
æķĻ
0.20
Activations Density 0.226%