INDEX
Explanations
references to burgers
references to burgers
New Auto-Interp
Negative Logits
agnetic
-0.84
oral
-0.76
inel
-0.75
Emin
-0.69
agall
-0.69
uchin
-0.68
Atmospheric
-0.68
nces
-0.67
eus
-0.66
Soros
-0.66
POSITIVE LOGITS
burger
0.87
burgers
0.86
emoji
0.85
meat
0.83
urger
0.82
steak
0.81
hamb
0.81
bowl
0.80
becue
0.79
balls
0.78
Activations Density 0.040%