INDEX
Explanations
references to "basic" concepts, principles, or items across various contexts
New Auto-Interp
Negative Logits
ous
-0.16
383
-0.15
esi
-0.15
LE
-0.15
ynes
-0.14
hot
-0.14
descended
-0.14
etc
-0.14
opot
-0.14
specific
-0.14
POSITIVE LOGITS
/basic
0.34
DBObject
0.26
NameValuePair
0.25
mente
0.22
-basic
0.21
premise
0.20
/simple
0.19
ially
0.19
/original
0.19
xes
0.18
Activations Density 0.031%