INDEX
Explanations
questions starting with "What does" in the context of seeking meaning or implications
questions and references to meaning or implications
New Auto-Interp
Negative Logits
geoning
-0.71
TPPStreamerBot
-0.71
imov
-0.70
oys
-0.63
sung
-0.63
noticed
-0.62
oved
-0.62
geon
-0.62
abad
-0.62
mort
-0.61
POSITIVE LOGITS
entail
1.38
entails
1.33
mean
1.25
Means
1.12
means
1.07
meant
1.05
mean
1.00
signify
1.00
signifies
0.93
boils
0.90
Activations Density 0.129%