INDEX
Explanations
words related to a specific entity, potentially a person named "Hickenlooper"
the name "Hickenlooper."
New Auto-Interp
Negative Logits
nom
-0.65
Mandatory
-0.64
atography
-0.60
rw
-0.59
oret
-0.59
mileage
-0.58
doc
-0.58
breakthrough
-0.58
doc
-0.57
NPR
-0.56
POSITIVE LOGITS
icken
1.25
backer
1.03
ergy
0.98
bats
0.91
cies
0.87
anguage
0.83
fold
0.83
hs
0.82
ilage
0.81
aughs
0.79
Activations Density 0.007%