INDEX
Explanations
references to schools, education, or school-related experiences
New Auto-Interp
Negative Logits
school
-1.15
school
-0.92
School
-0.86
School
-0.82
SCHOOL
-0.82
école
-0.71
chool
-0.70
scuola
-0.69
SCHOOL
-0.69
skolan
-0.66
POSITIVE LOGITS
Italijanski
0.57
Administrativna
0.54
Hentet
0.52
antaine
0.50
الاطلاع
0.49
хьтан
0.49
nesc
0.48
сылкі
0.47
0.47
оригіналу
0.47
Activations Density 0.014%