INDEX
Explanations
words related to a specific name or place, "Söder.”
variations of a specific character or symbol within the text
New Auto-Interp
Negative Logits
entirety
-0.77
xual
-0.72
Drawn
-0.64
Hercules
-0.63
riott
-0.62
aneously
-0.61
enged
-0.61
Gemini
-0.61
aneous
-0.60
Sasuke
-0.59
POSITIVE LOGITS
ö
1.20
·
1.11
¶
1.02
ä
0.96
¬
0.95
ön
0.95
zbek
0.92
¸
0.92
Ķ
0.92
Ã¥
0.92
Activations Density 0.006%