INDEX
Explanations
the presence of the names "Jean" and "Jane."
New Auto-Interp
Negative Logits
RenderAtEndOf
-1.02
myſelf
-1.01
itſelf
-1.01
Monfieur
-0.92
nahilalakip
-0.91
Eſ
-0.89
يتيمه
-0.89
Efq
-0.89
曖昧さ回避
-0.88
AndEndTag
-0.88
POSITIVE LOGITS
Jane
1.87
Jean
1.65
Jane
1.64
Jean
1.43
Jan
1.42
Jan
1.39
jan
1.35
JAN
1.27
JEAN
1.27
jane
1.26
Activations Density 0.051%