INDEX
Explanations
references to the fashion magazine "Vogue" and related terms
references to specific cultural or artistic topics related to places and individuals
New Auto-Interp
Negative Logits
rogen
-0.87
raz
-0.78
ework
-0.77
anas
-0.76
eworks
-0.72
ulously
-0.69
ologies
-0.67
inet
-0.67
ulas
-0.67
penter
-0.67
POSITIVE LOGITS
Wilde
0.92
NX
0.76
tsky
0.72
phal
0.71
Beach
0.69
é»Ĵ
0.68
velt
0.67
ãħĭ
0.67
Leone
0.65
ppe
0.65
Activations Density 0.089%