INDEX
Explanations
mentions of Twitter usernames
sequences of underscores or placeholders for usernames or tags
New Auto-Interp
Negative Logits
screenings
-0.83
reception
-0.81
aud
-0.77
screening
-0.71
chlorine
-0.71
manned
-0.70
divers
-0.69
bounce
-0.69
ric
-0.68
Manson
-0.68
POSITIVE LOGITS
ebook
1.32
chance
1.26
EStreamFrame
1.08
vs
1.08
dust
1.04
429
1.04
RAW
1.04
must
1.02
SOURCE
1.00
deck
1.00
Activations Density 0.020%