INDEX
Explanations
hyphenated words and phrases
phrases indicating negation or absence
New Auto-Interp
Negative Logits
Ake
-0.71
Seeking
-0.69
Pok
-0.68
fade
-0.68
harshly
-0.68
moder
-0.68
Tik
-0.65
Levi
-0.65
geared
-0.65
cultiv
-0.65
POSITIVE LOGITS
same
1.27
middle
1.21
mom
1.20
money
1.16
grain
1.14
scenes
1.14
prem
1.14
distance
1.14
surface
1.12
ground
1.11
Activations Density 0.020%