INDEX
Explanations
words associated with sneezing or similar sounding actions
New Auto-Interp
Negative Logits
ri
-0.19
æĿľ
-0.17
ro
-0.17
929
-0.17
mach
-0.17
rim
-0.16
marsh
-0.16
sh
-0.16
ree
-0.16
ustin
-0.16
POSITIVE LOGITS
aks
0.26
eps
0.21
aking
0.21
eer
0.20
eding
0.20
akers
0.20
ez
0.19
eds
0.19
eks
0.19
ats
0.19
Activations Density 0.110%