INDEX
Explanations
phrases related to teaching experience and character development
New Auto-Interp
Negative Logits
gem
-0.16
899
-0.15
umont
-0.15
athing
-0.15
è³ŀ
-0.14
HX
-0.14
694
-0.14
urch
-0.14
@brief
-0.14
ke
-0.13
POSITIVE LOGITS
èĢģ
0.23
older
0.23
aged
0.22
old
0.21
age
0.20
older
0.19
-aged
0.19
æĹ§
0.19
-old
0.19
éĺħ
0.19
Activations Density 0.328%