INDEX
Explanations
references to celebrities and their public appearances
New Auto-Interp
Negative Logits
dos
-0.20
deg
-0.19
disproportion
-0.17
dose
-0.17
dos
-0.17
Dip
-0.17
ded
-0.17
460
-0.16
docking
-0.16
Dough
-0.16
POSITIVE LOGITS
Dan
1.13
Daniel
1.12
DAN
1.09
Dan
1.01
dan
0.99
Daniel
0.98
Daniels
0.89
dan
0.84
Danny
0.77
Dani
0.75
Activations Density 0.044%