INDEX
Explanations
expressions of hope and expectations for positive outcomes
New Auto-Interp
Negative Logits
almost
-0.23
almost
-0.22
Almost
-0.19
Almost
-0.18
neredeyse
-0.18
probably
-0.17
Probably
-0.17
likely
-0.16
æģIJ
-0.16
aler
-0.16
POSITIVE LOGITS
soon
0.27
somehow
0.24
soon
0.22
algún
0.21
someday
0.20
alespoÅĪ
0.19
Soon
0.19
eventual
0.19
-нибÑĥдÑĮ
0.19
enough
0.18
Activations Density 0.151%