INDEX
Explanations
historical figures and their contributions to science
New Auto-Interp
Negative Logits
chten
-0.18
igham
-0.15
akeup
-0.15
luet
-0.15
meer
-0.14
dof
-0.14
onia
-0.13
istrat
-0.13
elts
-0.13
idity
-0.13
POSITIVE LOGITS
Cru
0.16
182
0.15
Prix
0.15
186
0.14
treat
0.14
Sir
0.14
189
0.14
Richards
0.14
Ñħодим
0.14
publishing
0.14
Activations Density 0.120%