INDEX
Explanations
names and affiliations related to educational institutions
New Auto-Interp
Negative Logits
orneys
-0.63
informée
-0.52
momix
-0.52
dziew
-0.51
ilets
-0.51
sellor
-0.51
nakalista
-0.50
́ng
-0.49
âne
-0.48
IFICATIONS
-0.48
POSITIVE LOGITS
University
1.06
university
0.98
UNIVERSITY
0.97
University
0.92
College
0.86
universities
0.79
College
0.78
college
0.77
university
0.76
université
0.74
Activations Density 0.218%