INDEX
Explanations
words associated with academic programs and coursework
New Auto-Interp
Negative Logits
ãĤ¿ãĥ¼
-0.16
essor
-0.16
onds
-0.15
nett
-0.15
icies
-0.15
оÑĢоÑĤ
-0.14
destin
-0.14
ñas
-0.14
ined
-0.14
kowski
-0.14
POSITIVE LOGITS
courses
0.47
Courses
0.36
course
0.35
courses
0.35
credits
0.35
classes
0.32
core
0.31
Courses
0.30
course
0.29
Credits
0.28
Activations Density 0.059%