INDEX
Explanations
words or phrases related to evaluating the impact of actions or situations
phrases expressing benefits or positives for various subjects or contexts
New Auto-Interp
Negative Logits
aque
-0.77
rette
-0.68
ohn
-0.67
aram
-0.67
venants
-0.67
cles
-0.67
laun
-0.66
rose
-0.65
clone
-0.65
operated
-0.65
POSITIVE LOGITS
morale
0.93
everybody
0.86
geries
0.83
gotten
0.78
laughs
0.78
us
0.77
storing
0.76
awhile
0.76
everyone
0.76
efficiency
0.75
Activations Density 0.097%