INDEX
Explanations
phrases related to expectations or obligations
New Auto-Interp
Negative Logits
itſelf
-0.98
myſelf
-0.93
Efq
-0.92
pleaſure
-0.83
kháu
-0.81
Jefus
-0.81
fince
-0.80
BoxDecoration
-0.78
poffe
-0.77
themſelves
-0.77
POSITIVE LOGITS
supposed
1.58
supposed
1.30
supposedly
0.99
meant
0.98
suppose
0.92
supuestamente
0.78
meant
0.77
intended
0.73
suppos
0.71
should
0.69
Activations Density 0.124%