INDEX
Explanations
phrases that begin with "Of" or "of," indicating a focus on ownership or references to specific perspectives
New Auto-Interp
Negative Logits
numerator
-0.16
acked
-0.15
ingt
-0.15
umerator
-0.15
wick
-0.15
icking
-0.14
edn
-0.14
ismatch
-0.14
apons
-0.14
adf
-0.14
POSITIVE LOGITS
course
0.43
course
0.32
Course
0.29
Course
0.27
COUR
0.26
.course
0.25
_course
0.24
-course
0.24
OURSE
0.22
entimes
0.22
Activations Density 0.048%