INDEX
Explanations
references to specific education or training programs
references to educational or instructional categories
New Auto-Interp
Negative Logits
Leaks
-0.80
agate
-0.75
erry
-0.70
lez
-0.66
dor
-0.64
redo
-0.64
Flip
-0.62
oil
-0.61
pire
-0.61
jer
-0.61
POSITIVE LOGITS
classes
3.96
Classes
3.33
classes
2.93
class
2.59
Class
2.10
class
1.98
Class
1.96
courses
1.78
CLASS
1.75
subclass
1.62
Activations Density 0.015%