INDEX
Explanations
the word "course" used in various contexts
references to educational courses or courses of action
New Auto-Interp
Negative Logits
alty
-0.81
nesday
-0.71
mented
-0.71
gie
-0.64
uggets
-0.62
ry
-0.62
nect
-0.61
erd
-0.60
spat
-0.59
arijuana
-0.58
POSITIVE LOGITS
course
0.99
ourses
0.94
Course
0.89
books
0.88
Course
0.84
work
0.81
washer
0.81
instructors
0.78
courses
0.77
path
0.76
Activations Density 0.018%