INDEX
Explanations
the name "Rosa" with varying intensities
mentions of the name "Rosa" and other related names
New Auto-Interp
Negative Logits
ype
-0.88
ramer
-0.88
eers
-0.85
eer
-0.83
een
-0.78
redit
-0.77
teen
-0.76
rane
-0.76
urated
-0.74
liest
-0.72
POSITIVE LOGITS
Luxem
1.28
Parks
0.94
Mata
0.87
Rosa
0.80
Osw
0.77
issance
0.77
quez
0.76
Luxembourg
0.74
hea
0.72
Ramirez
0.70
Activations Density 0.017%