INDEX
Explanations
phrases that express reflections on expectations and perceptions versus reality
New Auto-Interp
Negative Logits
ulur
-0.16
меÑĤÑĮ
-0.16
shouldn
-0.15
asta
-0.15
obby
-0.15
wonder
-0.14
моÑĤÑĢеÑĤÑĮ
-0.14
weren
-0.14
ë¥
-0.14
luet
-0.14
POSITIVE LOGITS
barg
0.28
Barg
0.22
bargain
0.21
initially
0.21
originally
0.21
bargaining
0.20
strictly
0.20
realize
0.20
realise
0.19
otherwise
0.19
Activations Density 0.069%