INDEX
Explanations
references to individuals named Albert, particularly Albert Einstein
mentions of the name "Albert," particularly in connection with notable figures like Einstein
New Auto-Interp
Negative Logits
BOOK
-0.85
ãĥ¯
-0.84
ned
-0.79
ning
-0.75
elcome
-0.75
osuke
-0.74
pter
-0.73
mble
-0.73
efully
-0.73
ners
-0.72
POSITIVE LOGITS
Einstein
1.10
Pu
0.79
Calder
0.78
Schwe
0.77
Heights
0.71
onso
0.70
Hammond
0.70
rand
0.69
inas
0.69
inates
0.68
Activations Density 0.025%