INDEX
Explanations
dates in a specific format
dates mentioned within the document
New Auto-Interp
Negative Logits
ACTED
-0.76
milo
-0.76
ropolitan
-0.75
rower
-0.73
Reviewer
-0.70
diaper
-0.70
ROR
-0.69
akings
-0.68
diapers
-0.67
thirds
-0.64
POSITIVE LOGITS
Jan
1.03
itors
1.03
Jan
1.01
uary
1.01
vier
0.97
Feb
0.95
itor
0.94
Feb
0.87
Nov
0.87
uine
0.81
Activations Density 0.007%