INDEX
Explanations
the word "do" in various forms and contexts
New Auto-Interp
Negative Logits
Efq
-1.19
nahilalakip
-1.16
&___
-1.06
WriteBarrier
-1.02
pleaſure
-0.99
ſelf
-0.97
CloseOperation
-0.96
مرئيه
-0.94
GEBURTSDATUM
-0.93
مشين
-0.93
POSITIVE LOGITS
0.63
I
0.62
done
0.62
form
0.57
In
0.54
is
0.53
in
0.52
,
0.52
`
0.52
so
0.51
Activations Density 0.129%