INDEX
Explanations
terms that are used to refer to specific concepts or things
definitions and explanations of terms
New Auto-Interp
Negative Logits
idav
-0.80
asive
-0.77
iar
-0.71
Veter
-0.69
ó
-0.67
JM
-0.67
pour
-0.67
aqu
-0.66
asions
-0.66
©¶æ¥µ
-0.66
POSITIVE LOGITS
bidden
0.80
qualities
0.77
abbre
0.74
initials
0.71
shorthand
0.69
plural
0.69
irregular
0.68
anything
0.68
vow
0.67
atomic
0.66
Activations Density 0.126%