INDEX
Explanations
mentions of specific locations or names
repeated instances of specific names and terms, particularly those related to named entities or brands
New Auto-Interp
Negative Logits
IFF
-0.88
DPR
-0.76
esp
-0.69
ASA
-0.65
crop
-0.64
pter
-0.64
iffs
-0.63
jams
-0.63
iff
-0.62
crust
-0.62
POSITIVE LOGITS
alle
2.33
Nug
1.47
Nu
1.47
ÙĦ
1.36
Doe
1.31
uana
1.22
Lu
1.09
Louie
1.04
oleon
1.04
oxy
1.02
Activations Density 0.049%