INDEX
Explanations
names of people or characters
elements related to humor and comedy
New Auto-Interp
Negative Logits
UNCLASSIFIED
-0.49
aucas
-0.48
`
-0.48
acl
-0.47
egu
-0.45
Islamic
-0.45
Af
-0.44
ÃŃs
-0.44
Pesh
-0.43
miss
-0.43
POSITIVE LOGITS
dudes
0.70
hilar
0.65
nerds
0.65
dude
0.62
Tumblr
0.60
Tumblr
0.59
Nerd
0.59
nerd
0.57
Craigslist
0.56
trolling
0.56
Activations Density 2.760%