INDEX
Explanations
phrases that include superlatives indicating something being the most popular, valuable, fascinating, etc., in comparison to others
phrases or concepts associated with being among the best or top entities in various categories
New Auto-Interp
Negative Logits
iture
-0.99
thereof
-0.73
nance
-0.67
Zup
-0.65
ESSION
-0.65
FS
-0.64
ENCE
-0.63
Rex
-0.63
anse
-0.62
eret
-0.62
POSITIVE LOGITS
earliest
1.22
oldest
1.16
hottest
1.09
coolest
1.08
strongest
1.08
busiest
1.07
simplest
1.03
quir
1.02
largest
1.02
fastest
1.02
Activations Density 0.082%