INDEX
Explanations
dates, specifically in the form of "Month Day"
occurrences of the word "January" followed by numerical dates
New Auto-Interp
Negative Logits
orcs
-0.65
aho
-0.64
glances
-0.62
ography
-0.61
ific
-0.60
ologies
-0.59
absorbs
-0.59
lov
-0.59
totem
-0.58
ileged
-0.57
POSITIVE LOGITS
January
3.21
February
2.75
January
2.66
December
2.60
November
2.52
October
2.41
July
2.40
September
2.34
March
2.34
April
2.29
Activations Density 0.015%