INDEX
Explanations
adjectives related to degree or comparison
comparative phrases that highlight varying degrees of significance or quality
New Auto-Interp
Negative Logits
stead
-0.67
ãĥİ
-0.59
SON
-0.58
ools
-0.58
ãĥīãĥ©ãĤ´ãĥ³
-0.58
Ws
-0.56
POR
-0.56
UCT
-0.54
inders
-0.53
MIN
-0.52
POSITIVE LOGITS
athlet
0.88
anymore
0.86
idious
0.84
nor
0.71
as
0.70
amus
0.70
istic
0.70
defensively
0.69
than
0.68
itatively
0.68
Activations Density 0.114%