INDEX
Explanations
references to a specific individual named Albert, likely referencing Albert Einstein
New Auto-Interp
Negative Logits
osuke
-0.86
ned
-0.85
BOOK
-0.78
ning
-0.74
ners
-0.72
ãĥ¯
-0.71
efully
-0.70
glers
-0.70
pter
-0.70
packing
-0.69
POSITIVE LOGITS
Einstein
1.13
Pu
0.81
Schwe
0.81
rand
0.80
onso
0.79
inas
0.79
Calder
0.76
Wenger
0.71
inite
0.69
Hammond
0.69
Activations Density 0.024%