INDEX
Explanations
phrases that indicate quantities or assessments of items and definitions of standards
New Auto-Interp
Negative Logits
ê°Ŀ
-0.18
strcasecmp
-0.16
ILT
-0.15
ãĥĥãĤ·ãĥ¥
-0.15
ãģĹãģĭ
-0.15
Gab
-0.15
IGHL
-0.14
ãģĤãĤĬãģĮãģ¨ãģĨ
-0.14
.Assertions
-0.14
пÑĢазд
-0.14
POSITIVE LOGITS
means
0.23
mean
0.23
meaning
0.21
meant
0.21
means
0.20
meaning
0.18
mean
0.18
refer
0.17
.mean
0.17
refers
0.17
Activations Density 0.050%