INDEX
Explanations
the term "mater" or variations of it
references to academic institutions or affiliations
New Auto-Interp
Negative Logits
ancies
-0.86
ted
-0.80
rons
-0.73
arte
-0.65
APE
-0.65
ards
-0.65
short
-0.64
Asheville
-0.62
tyard
-0.61
manager
-0.61
POSITIVE LOGITS
estro
1.07
eus
1.06
ñ
0.99
ples
0.96
pl
0.95
ña
0.92
esthesia
0.89
ppa
0.89
qua
0.87
pling
0.87
Activations Density 0.029%