INDEX
Explanations
cookie-related terms and actions
mentions of cookies, particularly in various contexts and discussions
New Auto-Interp
Negative Logits
SI
-0.74
WAYS
-0.73
ashtra
-0.72
abouts
-0.69
rior
-0.66
ional
-0.65
involved
-0.64
WARD
-0.63
Tsarnaev
-0.63
ities
-0.62
POSITIVE LOGITS
cookies
1.27
dough
1.17
cookie
1.07
jar
1.06
Cookies
0.98
jars
0.98
Clicker
0.98
Cookie
0.95
cutter
0.94
cookie
0.90
Activations Density 0.018%