INDEX
Explanations
superlatives and comparatives
phrases that highlight the significance or excellence of something
New Auto-Interp
Negative Logits
ooter
-0.82
ategory
-0.79
CLE
-0.74
0
-0.73
epad
-0.70
onet
-0.70
livion
-0.69
ESSION
-0.67
ée
-0.66
akening
-0.66
POSITIVE LOGITS
brightest
1.08
smartest
1.07
hars
1.06
earliest
1.05
finest
1.05
toughest
1.03
heaviest
1.02
criticisms
1.01
finer
1.01
strongest
0.96
Activations Density 0.130%