INDEX
Explanations
names and credentials of individuals, particularly in professional contexts
New Auto-Interp
Negative Logits
ollah
-0.15
rone
-0.14
rapped
-0.14
uran
-0.14
avis
-0.14
blick
-0.14
ouz
-0.13
liž
-0.13
097
-0.13
omb
-0.13
POSITIVE LOGITS
graduate
0.40
gradu
0.40
grad
0.39
graduated
0.39
grad
0.39
earned
0.37
Grad
0.37
earned
0.35
obtained
0.35
earning
0.35
Activations Density 0.262%