INDEX
Explanations
instances of the word "cup" and its variations
New Auto-Interp
Negative Logits
eering
-0.76
Giuliani
-0.71
Voc
-0.67
partName
-0.63
ccording
-0.63
IGHTS
-0.62
gearing
-0.62
reluct
-0.62
generational
-0.61
SPONSORED
-0.61
POSITIVE LOGITS
cake
1.94
cakes
1.86
cup
1.14
board
1.09
boards
1.05
idity
0.99
illo
0.91
ener
0.90
cups
0.90
atu
0.89
Activations Density 0.008%