INDEX
Explanations
phrases related to teaching and education
New Auto-Interp
Negative Logits
leigh
-0.17
-hole
-0.16
hole
-0.16
hole
-0.15
omo
-0.15
/down
-0.15
orf
-0.15
atta
-0.14
zung
-0.14
uck
-0.14
POSITIVE LOGITS
ings
0.17
quila
0.16
lint
0.16
ilim
0.16
éŀ
0.15
ToProps
0.15
/Instruction
0.15
preneur
0.15
æĿIJ
0.15
á»ĭnh
0.15
Activations Density 0.043%