INDEX
Explanations
the superlatives, particularly the word "Best."
the word "Best" in various contexts
New Auto-Interp
Negative Logits
URI
-0.68
nir
-0.66
balloon
-0.62
colon
-0.62
prolifer
-0.62
sectional
-0.61
iron
-0.61
flies
-0.61
verse
-0.60
wolf
-0.60
POSITIVE LOGITS
Best
3.48
Best
2.67
BEST
2.15
best
2.15
Worst
2.02
best
1.90
Favorite
1.46
Greatest
1.39
worst
1.32
Better
1.28
Activations Density 0.008%