INDEX
Explanations
dates specified with the day of the week for public figures
occurrences of the word "on" in various contexts
New Auto-Interp
Negative Logits
ENCY
-0.66
rador
-0.65
hetically
-0.65
APH
-0.60
oother
-0.60
ACTED
-0.60
RIC
-0.59
posed
-0.58
auga
-0.58
acter
-0.58
POSITIVE LOGITS
Tue
1.05
Feb
1.05
Apr
1.03
Aug
0.99
Thu
0.96
Jul
0.95
behalf
0.94
Nov
0.94
Sep
0.92
March
0.90
Activations Density 0.037%