INDEX
Negative Logits
distraction
-0.69
distracting
-0.67
intel
-0.66
Dickinson
-0.65
CIS
-0.61
disruptive
-0.61
menace
-0.61
Nunes
-0.61
heric
-0.61
advers
-0.60
POSITIVE LOGITS
Mart
1.12
street
0.93
Street
0.88
school
0.82
mart
0.81
School
0.81
Baltimore
0.79
Wal
0.77
Ele
0.76
Ber
0.76
Activations Density 0.006%