INDEX
Explanations
references to time or duration in phrases
New Auto-Interp
Negative Logits
ifr
-0.14
ingt
-0.14
uprav
-0.14
-hours
-0.14
ìĭ¬
-0.13
inq
-0.13
é£Łåĵģ
-0.13
dém
-0.13
_THIS
-0.13
/details
-0.13
POSITIVE LOGITS
cou
0.18
moth
0.16
_sem
0.16
Sem
0.16
cou
0.15
Whole
0.15
solid
0.15
wed
0.15
Whole
0.15
nite
0.15
Activations Density 0.198%