INDEX
Explanations
the exact word "Den" (capital D) in the text.
New Auto-Interp
Negative Logits
myſelf
-1.89
itſelf
-1.80
Efq
-1.79
Monfieur
-1.70
Theſe
-1.66
themſelves
-1.65
pleaſure
-1.65
himſelf
-1.61
Anſ
-1.60
Houſe
-1.57
POSITIVE LOGITS
de
1.00
del
0.90
en
0.85
d
0.81
0.77
und
0.76
di
0.75
par
0.74
des
0.74
du
0.74
Activations Density 0.002%