INDEX
Explanations
mentions of "cool" and its variations in the context of various topics
New Auto-Interp
Negative Logits
Madness
-0.15
bes
-0.14
sill
-0.14
ples
-0.14
larg
-0.14
rones
-0.14
hire
-0.14
azar
-0.14
coni
-0.14
verity
-0.13
POSITIVE LOGITS
cool
0.20
assin
0.18
Cool
0.18
icular
0.18
Cool
0.17
iness
0.17
stuff
0.17
coolest
0.16
cool
0.16
ãĤ¡
0.16
Activations Density 0.077%