INDEX
Explanations
mentions of salads
references to various types of salads
New Auto-Interp
Negative Logits
redit
-0.77
Borders
-0.76
venants
-0.71
ITNESS
-0.66
fram
-0.63
abusive
-0.63
iazep
-0.62
auer
-0.61
closed
-0.61
accountability
-0.61
POSITIVE LOGITS
greens
1.00
salad
0.95
dressing
0.95
eria
0.87
weed
0.86
bowl
0.85
flav
0.80
salads
0.79
ieri
0.79
Salad
0.78
Activations Density 0.012%