INDEX
Explanations
references to humor and comedic elements in the text
New Auto-Interp
Negative Logits
ormal
-0.17
rost
-0.16
heid
-0.15
adt
-0.15
bai
-0.15
ÑĨип
-0.14
Bers
-0.14
ouz
-0.14
umer
-0.14
Enlarge
-0.14
POSITIVE LOGITS
ifest
0.18
rung
0.16
CFO
0.16
fffffff
0.16
oft
0.14
prit
0.14
ÃŃch
0.14
Rodney
0.14
sapi
0.14
Jarvis
0.14
Activations Density 0.013%