INDEX
Explanations
names with the title "Sir"
prominent names of individuals, particularly those with honorific titles or significant roles
New Auto-Interp
Negative Logits
CTR
-0.77
erest
-0.77
pus
-0.68
uably
-0.66
*/(
-0.66
sugg
-0.65
veins
-0.64
dylib
-0.62
straw
-0.61
thru
-0.60
POSITIVE LOGITS
Admir
0.73
itus
0.71
Conan
0.71
Leigh
0.67
dden
0.65
uner
0.64
xual
0.63
Roberts
0.63
Skydragon
0.62
Divinity
0.61
Activations Density 0.088%