INDEX
Explanations
expressions of gratitude or acknowledgment
phrases related to expressions of gratitude and repetition of phrases
New Auto-Interp
Negative Logits
ibal
-0.71
Flickr
-0.65
Cruiser
-0.65
edia
-0.61
agos
-0.59
aminer
-0.59
ective
-0.59
Flickr
-0.58
Tycoon
-0.58
ãĥĥãĥĪ
-0.57
POSITIVE LOGITS
aloud
1.67
loudly
1.33
goodbye
1.25
loud
1.17
louder
1.09
Goodbye
0.96
farewell
0.96
sarcast
0.93
publicly
0.92
prayers
0.87
Activations Density 0.170%