INDEX
Explanations
expressions emphasizing connection and communication
New Auto-Interp
Negative Logits
cision
-0.15
exo
-0.15
ijd
-0.14
aque
-0.14
uj
-0.13
idea
-0.13
rench
-0.13
istros
-0.13
oub
-0.13
lier
-0.13
POSITIVE LOGITS
recipe
0.37
ingredient
0.34
ingredients
0.34
secret
0.30
keys
0.30
Ingredients
0.30
formula
0.29
Ingredients
0.29
Recipe
0.29
Ingredient
0.28
Activations Density 0.207%