INDEX
Explanations
proper nouns or names related to brands, companies, or products
the term "original" in various contexts
New Auto-Interp
Negative Logits
aiman
-0.76
wal
-0.74
rooms
-0.72
ucket
-0.71
attled
-0.70
uls
-0.68
washer
-0.68
raq
-0.68
avers
-0.68
ctrl
-0.66
POSITIVE LOGITS
Flavoring
0.96
Original
0.91
Poster
0.75
Version
0.74
ity
0.74
Original
0.73
Orig
0.71
Creator
0.69
Score
0.69
Trailer
0.69
Activations Density 0.013%