INDEX
Explanations
common surnames or proper nouns
proper nouns, particularly names and titles
New Auto-Interp
Negative Logits
BI
-0.70
infographic
-0.70
Graphic
-0.70
Sovereign
-0.69
enthusi
-0.68
FORM
-0.66
wholesale
-0.66
pione
-0.65
Uni
-0.65
RIS
-0.64
POSITIVE LOGITS
obar
0.91
rette
0.84
anyahu
0.82
vals
0.81
ady
0.81
rano
0.81
orius
0.80
usky
0.79
foo
0.79
ison
0.78
Activations Density 0.164%