INDEX
Explanations
dates formatted as month and day
New Auto-Interp
Negative Logits
screwed
-0.68
append
-0.65
suite
-0.63
Syndicate
-0.60
nces
-0.60
inals
-0.59
letes
-0.57
untold
-0.57
amines
-0.55
sucks
-0.55
POSITIVE LOGITS
eve
0.93
occasions
0.88
eteenth
0.84
flower
0.84
heels
0.80
occasion
0.73
uality
0.70
Aug
0.69
evening
0.68
GUI
0.67
Activations Density 0.056%