INDEX
Explanations
references to new or fresh items or entities
mentions of brands or brand-related terms
New Auto-Interp
Negative Logits
ulhu
-0.94
pmwiki
-0.87
veyard
-0.80
Poverty
-0.73
abama
-0.72
SEE
-0.71
cale
-0.69
poons
-0.69
Able
-0.69
vae
-0.69
POSITIVE LOGITS
ishing
0.98
loyalty
0.96
enburg
0.87
ished
0.86
brand
0.82
ages
0.79
Brand
0.77
aging
0.77
ishes
0.77
brand
0.72
Activations Density 0.016%