INDEX
Explanations
instances of phrases referring to being the top or best in a list or ranking
references to "number one" or top rankings in various contexts
New Auto-Interp
Negative Logits
ages
-0.80
Balt
-0.69
Spaces
-0.68
onics
-0.68
antic
-0.66
Ages
-0.66
apers
-0.65
uras
-0.64
areth
-0.64
Distance
-0.63
POSITIVE LOGITS
priority
0.74
contender
0.73
eree
0.72
offender
0.69
loader
0.69
trending
0.68
ecause
0.68
hitter
0.68
maker
0.67
spot
0.67
Activations Density 0.031%