INDEX
Explanations
titles or names with the word "Robert"
proper nouns related to notable individuals
New Auto-Interp
Negative Logits
ashtra
-0.82
pour
-0.67
achus
-0.64
ibo
-0.63
Marie
-0.63
uable
-0.62
voy
-0.60
itiveness
-0.59
igans
-0.59
warm
-0.59
POSITIVE LOGITS
ħĭ
0.80
ĵĺ
0.75
¶æ
0.69
Curve
0.64
Reich
0.63
²
0.63
Wilhelm
0.63
eston
0.63
Carney
0.62
ĻĤ
0.62
Activations Density 0.109%