INDEX
Explanations
words or phrases associated with invalid emails
references to controversial or criminal activities
New Auto-Interp
Negative Logits
favoring
-0.86
recognizing
-0.86
cius
-0.78
favors
-0.78
calibrated
-0.77
iations
-0.76
normalized
-0.76
constituted
-0.75
stabilize
-0.75
recognized
-0.75
POSITIVE LOGITS
Liverpool
1.01
Notting
1.01
Ukip
1.00
Newcastle
0.98
DUP
0.98
Nicola
0.95
Tube
0.95
Labour
0.95
GET
0.93
Scotland
0.92
Activations Density 0.217%