INDEX
Explanations
proper names, specifically "Henry"
references to the name "Henry."
New Auto-Interp
Negative Logits
discriminated
-0.79
âĶ
-0.71
bases
-0.69
polar
-0.68
utt
-0.67
nuts
-0.66
plane
-0.64
prevail
-0.64
nets
-0.62
Spac
-0.62
POSITIVE LOGITS
Henry
3.46
Henry
2.95
Edward
1.51
Kissinger
1.38
Richard
1.37
Henri
1.34
Wilhelm
1.29
Charles
1.27
Arthur
1.27
Philip
1.26
Activations Density 0.019%