INDEX
Explanations
questions or inquiries ending with a question mark
questions, particularly rhetorical or exclamatory ones
New Auto-Interp
Negative Logits
practition
-0.79
avorite
-0.78
srf
-0.78
uckland
-0.75
carbohyd
-0.74
corrid
-0.73
satell
-0.72
ailability
-0.71
antidepress
-0.70
rament
-0.70
POSITIVE LOGITS
Nope
1.46
Answer
1.45
Surely
1.41
Wouldn
1.38
Isn
1.38
Well
1.33
Why
1.31
Answer
1.26
Because
1.25
Didn
1.25
Activations Density 0.101%