INDEX
Explanations
mentions of the word "cool" and its variations
New Auto-Interp
Negative Logits
ynchronously
-0.20
eer
-0.18
idal
-0.18
ecure
-0.17
atically
-0.16
ecurity
-0.16
\API
-0.16
etc
-0.16
eter
-0.16
lene
-0.16
POSITIVE LOGITS
idge
0.32
ness
0.27
ers
0.26
Britann
0.23
headed
0.23
ibri
0.23
IDGE
0.23
breeze
0.23
ies
0.23
io
0.23
Activations Density 0.017%