INDEX
Explanations
words related to the name "Jones"
the presence of the name "Jones" across various contexts
New Auto-Interp
Negative Logits
RET
-0.74
frag
-0.72
ger
-0.68
pring
-0.67
ãĥ´
-0.67
incent
-0.66
crem
-0.66
english
-0.65
Dota
-0.65
refill
-0.65
POSITIVE LOGITS
Jones
3.81
Jones
3.14
Moore
1.41
Johnston
1.39
Owens
1.34
Perkins
1.30
Thompson
1.29
Johnson
1.29
Hanson
1.29
Hawkins
1.28
Activations Density 0.016%