INDEX
Explanations
references to "du" followed by another word
the repeated phrase "du" indicating a focus on specific cultural or artistic references
New Auto-Interp
Negative Logits
IO
-0.79
LER
-0.72
ULE
-0.71
ANS
-0.70
IAL
-0.70
ATTLE
-0.69
acco
-0.69
mable
-0.68
nor
-0.68
undle
-0.68
POSITIVE LOGITS
jour
0.88
masse
0.73
dé
0.70
gou
0.69
bone
0.68
lieu
0.68
ction
0.68
ught
0.67
ivery
0.65
wool
0.65
Activations Density 0.017%