INDEX
Explanations
mentions of college-related terms
mentions of "college."
New Auto-Interp
Negative Logits
++++++++++++++++
-0.67
Lumpur
-0.65
itsch
-0.63
ktop
-0.63
alez
-0.62
recomb
-0.62
///
-0.61
cream
-0.60
TRY
-0.59
aws
-0.58
POSITIVE LOGITS
uates
1.12
essor
0.94
tuition
0.94
campuses
0.91
graduates
0.87
campus
0.85
graduation
0.83
curric
0.82
basketball
0.80
professors
0.80
Activations Density 0.018%