INDEX
Explanations
phrases with the pattern "a or a" followed by a single word
phrases indicating uncertainty or choice
New Auto-Interp
Negative Logits
edit
-0.85
Edit
-0.83
Events
-0.81
words
-0.80
reports
-0.75
orders
-0.72
groups
-0.72
issues
-0.71
warn
-0.71
results
-0.70
POSITIVE LOGITS
rouse
1.03
lot
0.93
handful
0.90
combination
0.90
esthetic
0.89
mere
0.87
bunch
0.84
precursor
0.83
cknowled
0.81
dozen
0.81
Activations Density 0.400%