INDEX
Explanations
references to daily activities and routines
New Auto-Interp
Negative Logits
abh
-0.17
abwe
-0.16
gest
-0.16
canf
-0.15
atever
-0.15
Gest
-0.15
cgi
-0.15
encil
-0.15
оÑĢаз
-0.15
bookmark
-0.14
POSITIVE LOGITS
morning
0.21
Morning
0.18
mor
0.18
Breakfast
0.17
breakfast
0.17
mornings
0.17
Wake
0.17
Morning
0.16
Ïĥκ
0.16
ìķĦ침
0.16
Activations Density 0.159%