INDEX
Explanations
phrases related to definitions or explanations of terms or concepts
definitions or descriptions of various concepts and terms
New Auto-Interp
Negative Logits
Universities
-0.67
faiths
-0.63
directions
-0.63
glaciers
-0.60
selves
-0.60
Notting
-0.59
redes
-0.59
universities
-0.59
adjustments
-0.59
shores
-0.59
POSITIVE LOGITS
ALWAYS
1.06
usually
0.93
typically
0.92
preferable
0.87
always
0.86
typically
0.86
agine
0.85
someone
0.84
indistinguishable
0.84
ordinarily
0.84
Activations Density 0.178%