INDEX
Explanations
forms of wordplay, particularly puns
instances of wordplay focused on puns
New Auto-Interp
Negative Logits
Consent
-0.71
Empires
-0.70
Commons
-0.69
STD
-0.65
Biological
-0.65
Journals
-0.64
Refuge
-0.63
Issues
-0.61
Coh
-0.61
Divinity
-0.61
POSITIVE LOGITS
isher
1.08
ishers
1.04
pun
1.03
pun
1.00
cheon
0.93
itive
0.90
ting
0.86
oons
0.85
itial
0.84
hett
0.82
Activations Density 0.005%