INDEX
Explanations
phrases that emphasize the importance of context or specificity in various situations
New Auto-Interp
Negative Logits
heavy
-0.16
gili
-0.15
klady
-0.15
ILLE
-0.14
rarity
-0.14
νÏī
-0.14
Dense
-0.14
dense
-0.13
rowse
-0.13
ãĥĬãĥ¼
-0.13
POSITIVE LOGITS
ways
0.50
ways
0.34
Ways
0.32
manners
0.31
way
0.30
away
0.28
novel
0.25
creative
0.25
eways
0.25
away
0.24
Activations Density 0.098%