INDEX
Explanations
mentions of notable individuals, particularly those with a strong association with the name "Albert", like "Albert Einstein"
references to the name "Albert," particularly associated with notable figures such as Albert Einstein
New Auto-Interp
Negative Logits
BOOK
-0.86
ãĥ¯
-0.81
ned
-0.78
osuke
-0.78
ning
-0.74
pter
-0.72
efully
-0.71
ners
-0.69
mble
-0.68
ettlement
-0.68
POSITIVE LOGITS
Einstein
1.18
Pu
0.81
Calder
0.76
Schwe
0.75
inas
0.75
onso
0.73
inates
0.72
rand
0.72
Hammond
0.72
anus
0.69
Activations Density 0.022%