INDEX
Explanations
references to personal pronouns and the word "it."
New Auto-Interp
Negative Logits
للمعارف
-1.51
myſelf
-1.34
tvguidetime
-1.31
Efq
-1.29
httphttps
-1.25
cauſe
-1.24
صوتيه
-1.24
itſelf
-1.23
houſe
-1.22
تانيه
-1.22
POSITIVE LOGITS
.
0.79
↵↵
0.78
is
0.69
I
0.68
<eos>
0.68
0.68
,
0.67
(
0.66
↵
0.63
In
0.62
Activations Density 0.710%