INDEX
Explanations
sentences containing questions or inquiries
New Auto-Interp
Negative Logits
leck
-0.17
shire
-0.16
igkeit
-0.15
/fixtures
-0.15
AGON
-0.14
gie
-0.14
-ul
-0.14
.tell
-0.14
sak
-0.14
reu
-0.14
POSITIVE LOGITS
naires
0.26
naire
0.21
stell
0.16
pare
0.16
stown
0.15
ccione
0.14
lycer
0.14
stellung
0.14
arrow
0.14
eger
0.14
Activations Density 0.041%