INDEX
Explanations
references to teachers and educators
references to teachers
New Auto-Interp
Negative Logits
rils
-0.88
tera
-0.74
ril
-0.74
axy
-0.65
hawks
-0.65
rencies
-0.64
launchers
-0.63
00200000
-0.62
obin
-0.62
ulence
-0.62
POSITIVE LOGITS
girls
0.92
Teachers
0.88
Teacher
0.85
heet
0.82
teacher
0.81
children
0.78
achers
0.75
student
0.75
girl
0.75
teachers
0.74
Activations Density 0.028%