INDEX
Explanations
phrases describing a process or workflow
statements explaining processes or concepts
New Auto-Interp
Negative Logits
uable
-0.73
prized
-0.62
worthiness
-0.62
contamin
-0.62
risked
-0.61
inventoryQuantity
-0.60
Newsp
-0.60
overcame
-0.60
reon
-0.60
worth
-0.59
POSITIVE LOGITS
follows
1.13
straightforward
1.10
simple
1.02
summarized
1.00
similar
0.95
:[
0.92
identical
0.88
simplest
0.87
chronological
0.87
analogous
0.85
Activations Density 0.319%