INDEX
Explanations
phrases related to things being well-defined or well-described
phrases indicating something that is well-defined or well-regarded
New Auto-Interp
Negative Logits
Midnight
-0.71
Cutter
-0.70
Phi
-0.69
Bravo
-0.69
Crisis
-0.69
Compass
-0.67
Theft
-0.67
Indigo
-0.66
Tags
-0.64
Hancock
-0.64
POSITIVE LOGITS
known
1.37
established
1.33
defined
1.30
enough
1.28
trained
1.27
earned
1.26
documented
1.25
connected
1.24
respected
1.23
intention
1.23
Activations Density 0.032%