INDEX
Explanations
questions that begin with the word "Why."
New Auto-Interp
Negative Logits
presented
-0.58
Hauptartikel
-0.56
tend
-0.56
erl
-0.55
")[
-0.54
XXXXXXXX
-0.50
Barker
-0.50
relative
-0.50
Depend
-0.50
depend
-0.49
POSITIVE LOGITS
why
3.50
why
3.30
Why
3.10
Why
3.07
WHY
2.96
WHY
2.94
pourquoi
2.62
Waarom
2.56
Warum
2.48
waarom
2.40
Activations Density 0.059%