INDEX
Explanations
words related to expressing gratitude or making requests
phrases indicating intentions or requests
New Auto-Interp
Negative Logits
ById
-0.76
furt
-0.74
adolesc
-0.67
Stru
-0.67
Hess
-0.64
Worth
-0.64
Nash
-0.64
Abel
-0.61
metadata
-0.60
Five
-0.60
POSITIVE LOGITS
congratulate
1.06
emulate
1.02
encourage
1.01
propose
0.97
clarify
0.95
reassure
0.95
discourage
0.93
assure
0.92
hear
0.92
nominate
0.91
Activations Density 0.049%