INDEX
Explanations
questions beginning with "Why"
New Auto-Interp
Negative Logits
rovnik
-0.55
externes
-0.54
következő
-0.49
Personal
-0.47
Personal
-0.47
دت
-0.47
personal
-0.47
House
-0.46
House
-0.46
有所
-0.46
POSITIVE LOGITS
Why
1.27
Why
1.26
WHY
1.25
why
1.23
Waarom
1.18
WHY
1.16
Warum
1.10
why
1.08
为什么要
1.04
Pourquoi
1.03
Activations Density 0.070%