INDEX
Explanations
dates written in a specific format (month day, year) accompanied by an author's name or title
instances of dates and their formatting in texts
New Auto-Interp
Negative Logits
tram
-0.76
appoint
-0.76
confir
-0.76
sculpt
-0.75
tack
-0.70
funnel
-0.70
eleph
-0.69
estab
-0.69
scrap
-0.68
elect
-0.68
POSITIVE LOGITS
Introduction
1.89
Enlarge
1.69
Overview
1.58
Posted
1.56
Welcome
1.55
toggle
1.54
Dear
1.54
Hello
1.50
Guest
1.49
Published
1.49
Activations Density 0.286%