INDEX
Explanations
references to a specific retail store chain
mentions of a specific retail brand
New Auto-Interp
Negative Logits
Tayyip
-0.72
deaf
-0.67
UTH
-0.67
icter
-0.65
untreated
-0.64
fertile
-0.63
HEAD
-0.62
polarization
-0.62
pregnant
-0.61
Anchorage
-0.61
POSITIVE LOGITS
mart
1.18
inez
0.97
ards
0.96
Mart
0.95
ukong
0.93
inho
0.91
Stores
0.87
ertodd
0.87
inet
0.87
lain
0.86
Activations Density 0.006%