INDEX
Explanations
specific brand names
mentions of the Coca-Cola brand and related companies
New Auto-Interp
Negative Logits
flight
-0.69
FAA
-0.68
Ro
-0.67
HIP
-0.66
awar
-0.65
bound
-0.64
flight
-0.64
auld
-0.62
rosc
-0.60
liest
-0.60
POSITIVE LOGITS
Cola
1.94
Bottle
0.95
wagen
0.94
wagon
0.92
fountain
0.89
Coke
0.87
weed
0.84
xon
0.84
Bott
0.84
convol
0.80
Activations Density 0.008%