INDEX
Explanations
phrases or terms that indicate high quality or high value
"high" preceding a characteristic
high followed by specific descriptors
New Auto-Interp
Negative Logits
opsis
-0.48
Cordero
-0.45
oplan
-0.45
amante
-0.43
Beren
-0.43
batter
-0.42
trn
-0.42
makebox
-0.41
anatomy
-0.41
GeneratedCode
-0.40
POSITIVE LOGITS
High
1.08
high
1.08
High
1.02
HIGH
1.01
high
0.96
HIGH
0.89
highest
0.76
Low
0.75
Highest
0.75
Highest
0.74
Activations Density 0.224%