INDEX
Explanations
phrases related to positive attributes or accomplishments
phrases indicating emotions and human experiences
New Auto-Interp
Negative Logits
)]
-0.54
],
-0.50
Description
-0.50
Opp
-0.49
Mons
-0.48
Originally
-0.48
Dimensions
-0.48
?",
-0.48
inventoryQuantity
-0.48
Burke
-0.48
POSITIVE LOGITS
accordingly
1.07
thereafter
0.88
thereof
0.75
afterward
0.72
afterwards
0.68
therein
0.68
attRot
0.61
wherever
0.59
thereto
0.58
versa
0.57
Activations Density 2.365%