INDEX
Explanations
mentions of the word "Bra" with varying levels of specificity and relevance
occurrences of the term "Bra."
New Auto-Interp
Negative Logits
ership
-0.81
matic
-0.79
regor
-0.75
ancial
-0.75
eers
-0.72
ARY
-0.72
omez
-0.71
Gutenberg
-0.69
ovych
-0.69
TY
-0.69
POSITIVE LOGITS
ille
1.04
kes
0.92
hma
0.92
ided
0.85
Bra
0.84
ves
0.84
keley
0.84
illard
0.82
zz
0.81
very
0.81
Activations Density 0.008%