INDEX
Explanations
phrases related to explanations or explorations
instances of the word "explore" or its variations that indicate an examination or investigation of topics
New Auto-Interp
Negative Logits
chal
-0.63
Amid
-0.63
faces
-0.61
yards
-0.60
petitions
-0.59
stand
-0.59
DAY
-0.58
thumbs
-0.58
tune
-0.56
pledge
-0.56
POSITIVE LOGITS
oit
1.56
orers
1.54
oded
1.40
oding
1.39
osion
1.38
icit
1.36
orer
1.35
oration
1.32
ained
1.31
aining
1.25
Activations Density 0.028%