INDEX
Explanations
references to specific names or titles
proper nouns, specifically names related to brands, movies, or places
New Auto-Interp
Negative Logits
";
-0.56
SPONSORED
-0.56
______
-0.54
��������
-0.52
Shutterstock
-0.51
advertisement
-0.51
)",
-0.51
behalf
-0.49
.....
-0.49
.):
-0.49
POSITIVE LOGITS
assures
0.92
has
0.92
insists
0.91
hasn
0.90
agrees
0.90
believes
0.90
knows
0.89
intends
0.88
recommends
0.88
expects
0.86
Activations Density 0.694%