INDEX
Explanations
phrases related to time and numbers
punctuation and distinct sentence endings
New Auto-Interp
Negative Logits
invaluable
-0.73
cheesy
-0.69
peace
-0.69
gratitude
-0.67
bounty
-0.66
pill
-0.66
nude
-0.66
arts
-0.65
gown
-0.65
wellness
-0.65
POSITIVE LOGITS
Both
1.38
Either
1.35
Neither
1.19
Differences
1.17
Difference
1.13
Together
1.10
Each
1.06
Again
1.04
Comparison
1.03
Both
1.03
Activations Density 0.769%