INDEX
Explanations
mentions of the name "Jill"
mentions of the name "Jill."
New Auto-Interp
Negative Logits
constitu
-0.92
tremend
-0.83
jaws
-0.77
obser
-0.76
ingred
-0.75
mble
-0.71
Þ
-0.70
deaf
-0.70
captcha
-0.69
ĻĤ
-0.69
POSITIVE LOGITS
Stein
1.07
ian
0.99
ians
0.99
Filip
0.98
Abram
0.88
alo
0.85
Jill
0.85
McCabe
0.80
athon
0.75
ante
0.75
Activations Density 0.028%