INDEX
Explanations
references to self-help and personal development resources
New Auto-Interp
Negative Logits
acad
-0.16
Transcript
-0.15
707
-0.14
anst
-0.14
alo
-0.14
209
-0.14
bottle
-0.13
APS
-0.13
Bottle
-0.13
-0.13
POSITIVE LOGITS
books
0.23
manuals
0.22
zet
0.19
books
0.19
Manuals
0.17
ç±į
0.17
Books
0.16
книги
0.16
-books
0.16
cook
0.15
Activations Density 0.113%