INDEX
Explanations
mentions of the name "Jones"
mentions of the name "Jones."
New Auto-Interp
Negative Logits
ULAR
-0.86
utive
-0.84
uated
-0.84
racted
-0.83
aneously
-0.82
ulatory
-0.76
uate
-0.75
rous
-0.74
aneous
-0.69
ãĤ£
-0.66
POSITIVE LOGITS
sey
1.09
boro
1.04
hole
0.89
Jones
0.85
Jones
0.84
cloth
0.80
holes
0.79
hawk
0.79
Jr
0.79
minster
0.77
Activations Density 0.030%