INDEX
Explanations
mentions of the word "Duke" with some contextual relation
references to the word "Duke."
New Auto-Interp
Negative Logits
tele
-0.66
ins
-0.64
tal
-0.64
responses
-0.61
Woodward
-0.60
di
-0.60
synd
-0.60
sensit
-0.59
staples
-0.59
forwards
-0.58
POSITIVE LOGITS
uke
4.56
ukes
2.83
uked
2.03
uka
1.52
uk
1.41
ugi
1.24
uki
1.24
ake
1.16
uku
1.16
hiba
1.15
Activations Density 0.007%