INDEX
Explanations
occurrences of the word "first" or phrases related to being the first of its kind or in a series
instances of something being labeled as "the first" in various contexts
New Auto-Interp
Negative Logits
riber
-0.72
apters
-0.72
ractor
-0.64
iolet
-0.63
urities
-0.62
asions
-0.60
rs
-0.60
ourning
-0.60
lov
-0.60
iane
-0.60
POSITIVE LOGITS
foray
1.01
indication
0.93
ever
0.93
casualty
0.92
step
0.90
tangible
0.89
thing
0.88
installment
0.85
major
0.84
truly
0.83
Activations Density 0.075%