INDEX
Explanations
proper nouns, specifically names such as "Howard"
mentions of the name "Howard."
New Auto-Interp
Negative Logits
asus
-0.76
yrim
-0.76
terday
-0.69
ccording
-0.68
erala
-0.68
ktop
-0.66
BOOK
-0.66
CVE
-0.65
tera
-0.65
ebin
-0.64
POSITIVE LOGITS
Stern
1.09
Howard
1.04
Dean
0.87
Ellis
0.86
Schultz
0.85
Webb
0.81
Howard
0.80
Neville
0.79
Hughes
0.78
Buffett
0.77
Activations Density 0.007%