INDEX
Explanations
instances of humor and light-hearted commentary in the text
New Auto-Interp
Negative Logits
imeline
-0.15
ÄĽÅ¾
-0.14
اÙĨÙĩ
-0.14
&o
-0.14
eÄį
-0.14
strup
-0.13
inex
-0.13
aneous
-0.13
że
-0.13
izioni
-0.13
POSITIVE LOGITS
CLR
0.14
egl
0.14
ên
0.14
Ú
0.14
pher
0.14
lor
0.14
Nep
0.14
ð
0.14
ark
0.14
634
0.14
Activations Density 0.425%