INDEX
Explanations
mentions of sponsors or sponsored content
terms related to sponsorship
New Auto-Interp
Negative Logits
selves
-0.77
pend
-0.75
otom
-0.72
izabeth
-0.71
itri
-0.69
cale
-0.68
msec
-0.67
asure
-0.67
olit
-0.67
juries
-0.66
POSITIVE LOGITS
hips
1.00
sponsor
0.95
orship
0.94
sponsorship
0.91
sponsors
0.90
sponsoring
0.89
sponsored
0.87
Spons
0.85
sponsored
0.76
Spons
0.74
Activations Density 0.024%