INDEX
Explanations
interrogative phrases that seek clarification or information
New Auto-Interp
Negative Logits
PreInfinity
-0.71
ؤلاء
-0.71
니다
-0.67
صوتيه
-0.67
äten
-0.63
הזה
-0.63
समीक्षक
-0.63
iestety
-0.61
енча
-0.61
expandindo
-0.60
POSITIVE LOGITS
What
1.04
What
1.04
How
0.98
How
0.98
what
0.88
WHAT
0.84
WHAT
0.82
how
0.81
Why
0.81
Why
0.80
Activations Density 0.185%