INDEX
Explanations
mentions of burgers
references to burgers
New Auto-Interp
Negative Logits
inel
-0.79
agall
-0.77
agnetic
-0.72
uchin
-0.70
nces
-0.70
scrib
-0.69
perse
-0.67
Emin
-0.67
oral
-0.66
asio
-0.65
POSITIVE LOGITS
burger
0.93
burgers
0.90
urger
0.89
steak
0.85
balls
0.83
hamb
0.81
ards
0.81
becue
0.81
joints
0.81
cake
0.78
Activations Density 0.038%