INDEX
Explanations
inquiries or questions directed at the self or audience
New Auto-Interp
Negative Logits
opis
-0.17
kup
-0.15
sko
-0.14
ìĿ´ìķ¼
-0.14
kal
-0.13
бÑĥдÑĮ
-0.13
ategories
-0.13
loat
-0.13
Hammond
-0.13
isay
-0.13
POSITIVE LOGITS
:
0.32
Does
0.23
Where
0.23
How
0.23
What
0.22
Why
0.21
:Is
0.21
Who
0.20
whether
0.20
Whether
0.19
Activations Density 0.067%