INDEX
Explanations
references to individuals with the title "Mr." followed by names
New Auto-Interp
Negative Logits
."</
-0.80
=()
-0.72
Ẽ
-0.72
ագրություններ
-0.72
])).
-0.72
')")
-0.71
/).
-0.71
″]
-0.70
*/;
-0.69
)')
-0.68
POSITIVE LOGITS
Mr
0.93
Mr
0.90
Sirs
0.88
Mrs
0.81
R
0.76
Dr
0.76
Mrs
0.75
Sir
0.75
Dr
0.73
monsieur
0.72
Activations Density 0.102%