INDEX
Explanations
definitions or explanations of words and terms
terms related to meanings and translations
New Auto-Interp
Negative Logits
rodu
-0.88
Blair
-0.76
****************
-0.72
iew
-0.71
igators
-0.69
Frazier
-0.67
respons
-0.67
odynamics
-0.67
aughter
-0.65
ardy
-0.65
POSITIVE LOGITS
suffix
0.92
abbre
0.88
initials
0.87
denotes
0.84
Meaning
0.84
equivalent
0.80
α
0.79
insign
0.79
pronounced
0.77
denote
0.76
Activations Density 0.316%