INDEX
Explanations
references to the word "Duke"
references to the term "Duke."
New Auto-Interp
Negative Logits
hedon
-0.71
ãģĻ
-0.67
iped
-0.67
cells
-0.67
ulations
-0.66
iations
-0.65
ional
-0.64
served
-0.62
agency
-0.62
ters
-0.61
POSITIVE LOGITS
hyde
1.08
halla
0.80
ongyang
0.75
onna
0.74
Duke
0.73
Bride
0.71
ham
0.71
lac
0.71
hole
0.70
isine
0.67
Activations Density 0.021%