INDEX
Explanations
references to a specific type of food, particularly lobster and salmonella
mentions of specific people or brands
New Auto-Interp
Negative Logits
aleigh
-0.66
Fate
-0.64
DCS
-0.63
spirited
-0.61
éĹĺ
-0.60
Ùİ
-0.60
pron
-0.59
sidx
-0.59
ä¸Ń
-0.59
ailing
-0.58
POSITIVE LOGITS
Lob
1.22
otomy
0.96
Lon
0.96
cin
0.91
ovie
0.91
otom
0.91
opa
0.86
omon
0.85
ovy
0.85
oS
0.84
Activations Density 0.012%