INDEX
Explanations
names related to the name "Jan" or "Janet"
dates, specifically those in January
New Auto-Interp
Negative Logits
differences
-0.64
ple
-0.61
viewing
-0.60
experience
-0.60
implied
-0.59
consumption
-0.59
absorb
-0.58
Ult
-0.58
cre
-0.58
handheld
-0.58
POSITIVE LOGITS
jan
4.40
ja
1.60
Jan
1.42
jun
1.27
jon
1.26
je
1.24
ija
1.23
jen
1.20
jas
1.19
j
1.19
Activations Density 0.005%