INDEX
Explanations
questions and inquiries
questions starting with the word "What."
New Auto-Interp
Negative Logits
favour
-0.76
reserve
-0.74
favor
-0.74
arch
-0.71
close
-0.67
receiving
-0.64
firm
-0.64
ranking
-0.64
res
-0.63
due
-0.63
POSITIVE LOGITS
What
2.81
Why
2.21
How
2.11
Who
2.08
WHAT
2.05
what
2.04
Where
1.86
Which
1.81
What
1.77
Whatever
1.70
Activations Density 0.022%