INDEX
Explanations
references to jokes and pranks in various contexts
pranks and jokes
New Auto-Interp
Negative Logits
AndEndTag
-0.46
rungsseite
-0.43
guarantees
-0.32
treatment
-0.32
枚目
-0.31
loadModel
-0.31
SafeMath
-0.30
model
-0.30
owner
-0.30
delwed
-0.30
POSITIVE LOGITS
prank
0.79
pranks
0.68
hoax
0.66
ब्रेकडाउन
0.59
prilis
0.56
joke
0.53
richTextPanel
0.51
ocular
0.50
Downvote
0.49
humour
0.49
Activations Density 0.018%