INDEX
Explanations
mathematical or scientific terms and concepts
New Auto-Interp
Negative Logits
llan
-0.71
riad
-0.68
committee
-0.62
sensing
-0.61
scenes
-0.60
iculty
-0.60
Beaut
-0.60
olean
-0.60
olicy
-0.60
rely
-0.58
POSITIVE LOGITS
unciation
1.51
unci
1.36
ounced
1.29
ihilation
1.22
ouncing
1.20
ounces
1.18
ational
1.17
atural
1.13
ials
1.10
ounce
1.08
Activations Density 1.097%