INDEX
Explanations
words related to comparisons where one thing is significantly greater in quantity or quality than another
terms related to comparison and dominance in contexts of measurement and evaluation
New Auto-Interp
Negative Logits
quet
-0.61
Mystery
-0.60
pretended
-0.58
strange
-0.57
Cast
-0.57
curiously
-0.57
Mysterious
-0.56
Factory
-0.55
secret
-0.55
pretend
-0.54
POSITIVE LOGITS
outweigh
4.15
outwe
3.48
outnumbered
1.53
exceeds
1.38
overwhel
1.37
outper
1.33
outp
1.22
justifies
1.14
exceed
1.13
overshadow
1.09
Activations Density 0.021%