INDEX
Explanations
questions starting with "How does" and "What does"
questions that begin with "how does" or related phrases
New Auto-Interp
Negative Logits
fights
-0.79
ascript
-0.76
devices
-0.76
isphere
-0.74
boats
-0.72
runners
-0.72
cies
-0.70
tracks
-0.70
ishly
-0.70
sers
-0.70
POSITIVE LOGITS
anyone
0.86
anybody
0.86
olation
0.83
olated
0.72
one
0.68
omorphic
0.67
olate
0.67
it
0.67
this
0.67
onga
0.63
Activations Density 0.051%