INDEX
Explanations
references to popular culture
references to pop culture and its various manifestations
New Auto-Interp
Negative Logits
Vera
-0.77
humane
-0.64
heart
-0.64
Liv
-0.64
Sheep
-0.62
Tus
-0.61
Hus
-0.61
¥
-0.61
Ric
-0.60
Georg
-0.60
POSITIVE LOGITS
depictions
0.83
outlets
0.83
trivia
0.81
studios
0.81
culture
0.80
advertising
0.80
references
0.80
appropriation
0.77
References
0.77
satire
0.75
Activations Density 0.048%