INDEX
Explanations
phrases indicating judgment or opinion regarding subjects
New Auto-Interp
Negative Logits
omore
-0.14
ÙĬÙĥÙĪÙĨ
-0.14
uki
-0.13
eldom
-0.13
hi
-0.13
Miss
-0.13
èά
-0.12
æĢģ
-0.12
caps
-0.12
(s
-0.12
POSITIVE LOGITS
sik
0.15
angel
0.15
Jeg
0.14
Cop
0.14
ìļ´ëĵľ
0.14
/**č↵
0.14
ulk
0.14
ì¢Į
0.13
HasKey
0.13
Aligned
0.13
Activations Density 0.461%