INDEX
Explanations
references to roles in educational or health contexts
New Auto-Interp
Negative Logits
İ
-0.16
aad
-0.15
Erg
-0.15
oth
-0.14
ола
-0.14
urities
-0.14
ÙĦاÙĨ
-0.14
ottle
-0.13
fault
-0.13
ottes
-0.13
POSITIVE LOGITS
alike
0.49
/lic
0.15
обÑĢеÑĤ
0.15
ocomplete
0.15
agem
0.15
ehr
0.14
uba
0.14
anel
0.14
ibold
0.14
381
0.14
Activations Density 0.111%