INDEX
Explanations
proper nouns named "Robert"
the name "Robert" in various contexts
New Auto-Interp
Negative Logits
ILA
-0.75
BOOK
-0.69
POLIT
-0.65
ptive
-0.64
INESS
-0.63
utils
-0.62
cond
-0.61
cipline
-0.59
sylv
-0.59
tack
-0.59
POSITIVE LOGITS
Conquest
0.90
Mueller
0.88
Mug
0.87
Stacy
0.83
Byrd
0.82
Griffin
0.82
Kraft
0.81
Mercer
0.78
McDonnell
0.77
Reich
0.76
Activations Density 0.020%