INDEX
Explanations
references to a person named "Watson" with various characteristics or contexts
references to the name "Watson."
New Auto-Interp
Negative Logits
uate
-0.78
ccording
-0.75
egal
-0.73
ulatory
-0.73
fertile
-0.72
ional
-0.71
eways
-0.70
ĩ
-0.68
uation
-0.68
uated
-0.68
POSITIVE LOGITS
Watson
1.15
atson
1.11
combe
0.88
Norton
0.83
Holmes
0.83
Whe
0.81
Phill
0.81
tsky
0.78
orld
0.78
yth
0.74
Activations Density 0.025%