INDEX
Explanations
mentions of the word "emails"
mentions of emails
New Auto-Interp
Negative Logits
iasis
-0.75
axis
-0.74
stood
-0.70
Glasgow
-0.68
GBT
-0.68
OV
-0.67
mbuds
-0.67
vic
-0.67
hood
-0.67
Edinburgh
-0.66
POSITIVE LOGITS
Emails
0.94
inbox
0.91
correspondence
0.88
dumps
0.88
ileaks
0.88
exchanged
0.85
emails
0.85
messages
0.82
newsletters
0.82
archive
0.81
Activations Density 0.022%