INDEX
Explanations
expressions that involve contemplation or reflection on personal thoughts and societal situations
New Auto-Interp
Negative Logits
TestFixture
-0.16
utm
-0.16
ilon
-0.15
quist
-0.15
iod
-0.14
ecided
-0.14
alez
-0.13
¸ı
-0.13
zym
-0.13
eba
-0.13
POSITIVE LOGITS
buz
0.16
QL
0.15
looked
0.15
اظ
0.14
857
0.14
.strict
0.14
strictly
0.14
closely
0.14
unin
0.14
ä»Ķ
0.14
Activations Density 0.115%