INDEX
Explanations
words or phrases related to negative health or medical implications
New Auto-Interp
Negative Logits
Tennessee
-0.18
ÄĮR
-0.17
cialis
-0.17
VR
-0.16
VA
-0.16
Arizona
-0.15
TN
-0.15
Sloven
-0.15
VA
-0.15
Slovenia
-0.15
POSITIVE LOGITS
Johannesburg
0.19
Istanbul
0.19
the
0.19
Jakarta
0.18
Nairobi
0.18
.Metro
0.17
Kolkata
0.17
İstanbul
0.17
Lexington
0.16
Tokyo
0.16
Activations Density 0.023%