INDEX
Explanations
phrases related to a specific person named Elizabeth Warren
occurrences of the name "Elizabeth."
New Auto-Interp
Negative Logits
ursed
-0.86
acca
-0.75
yright
-0.73
²¾
-0.72
unal
-0.71
alien
-0.69
awaru
-0.68
ãĥ£
-0.67
ebin
-0.66
haps
-0.66
POSITIVE LOGITS
Elizabeth
1.17
Warren
1.10
Elizabeth
0.96
Howell
0.95
Jennings
0.90
Liu
0.84
Holmes
0.83
Hath
0.82
Dodd
0.80
tis
0.80
Activations Density 0.004%