INDEX
Explanations
occurrences of the name "Winfrey."
New Auto-Interp
Negative Logits
duct
-0.19
ea
-0.18
een
-0.18
ee
-0.18
alted
-0.18
cas
-0.17
foot
-0.17
ed
-0.17
eed
-0.16
ymous
-0.16
POSITIVE LOGITS
-win
0.29
throp
0.28
ograd
0.26
ning
0.26
chester
0.25
nable
0.24
/win
0.23
ona
0.23
try
0.23
eries
0.23
Activations Density 0.018%