INDEX
Explanations
the occurrence of personal pronouns like "you," "we," and "I."
how explaining or questioning
New Auto-Interp
Negative Logits
ब्रेकडाउन
-0.48
tentu
-0.48
obviously
-0.46
Obviously
-0.43
obviously
-0.42
Obviously
-0.42
fizer
-0.41
⧠
-0.40
évidemment
-0.40
avantage
-0.40
POSITIVE LOGITS
cómo
0.82
Cómo
0.79
How
0.71
how
0.71
如何
0.69
Cómo
0.68
如何
0.67
How
0.66
Hvordan
0.66
howto
0.65
Activations Density 0.010%