INDEX
Explanations
textual artifacts
phrases related to cultural dynamics or appropriation
New Auto-Interp
Negative Logits
booster
-0.70
consolidation
-0.69
horizont
-0.67
Minotaur
-0.66
juggling
-0.66
assetsadobe
-0.66
monop
-0.66
lain
-0.66
mattress
-0.65
planner
-0.65
POSITIVE LOGITS
Â
1.50
»
1.33
±
1.33
«
1.32
£
1.30
¢
1.25
\/
1.24
´
1.23
^
1.22
°
1.15
Activations Density 0.086%