INDEX
Explanations
superlatives and rankings related to different entities
phrases indicating superlative attributes or rankings in the world
New Auto-Interp
Negative Logits
ints
-0.74
xual
-0.72
adel
-0.69
edin
-0.69
onomy
-0.68
ULL
-0.68
irement
-0.67
²¾
-0.66
IRED
-0.65
ikawa
-0.64
POSITIVE LOGITS
largest
1.24
tallest
1.16
smallest
1.05
foremost
1.05
richest
1.04
busiest
1.03
fastest
1.02
longest
1.00
oldest
1.00
hottest
1.00
Activations Density 0.087%