INDEX
Explanations
comparative adjectives
the word "even" in various contexts emphasizing comparison or degree
New Auto-Interp
Negative Logits
units
-0.78
mson
-0.77
ATURES
-0.76
artments
-0.76
utics
-0.74
apons
-0.74
unia
-0.72
Libraries
-0.71
hops
-0.70
SPORTS
-0.68
POSITIVE LOGITS
spoiler
0.80
underdog
0.75
explanation
0.74
approximation
0.73
hitter
0.72
temper
0.70
sequel
0.69
excerpt
0.69
variant
0.69
uphill
0.69
Activations Density 0.114%