INDEX
Explanations
dates or events identified as "first"
occurrences of the word "first" in various contexts
New Auto-Interp
Negative Logits
tics
-0.85
ractions
-0.69
olia
-0.67
lang
-0.65
md
-0.65
iety
-0.64
gery
-0.64
Nadu
-0.63
ingen
-0.62
vor
-0.62
POSITIVE LOGITS
responders
1.07
baseman
1.05
batch
1.00
installment
0.94
glimpse
0.90
lady
0.89
ever
0.89
foray
0.87
iteration
0.87
incarnation
0.84
Activations Density 0.105%