INDEX
Explanations
mentions of salads
references to salad
New Auto-Interp
Negative Logits
eff
-0.75
ITNESS
-0.74
Apostle
-0.68
FB
-0.67
drawn
-0.66
SPONSORED
-0.64
built
-0.64
abilities
-0.63
doing
-0.63
gone
-0.63
POSITIVE LOGITS
salad
1.16
salads
1.15
Salad
0.97
greens
0.97
weed
0.95
dressing
0.95
tomatoes
0.89
bowl
0.88
lettuce
0.86
cabbage
0.84
Activations Density 0.007%