INDEX
Explanations
the emphasis on the word "so" and its varying contexts
New Auto-Interp
Negative Logits
μία
-0.76
indépendance
-0.75
vertes
-0.73
Breton
-0.71
Winfrey
-0.71
hond
-0.70
Portail
-0.70
wur
-0.70
Catania
-0.70
Brugge
-0.69
POSITIVE LOGITS
so
1.71
So
1.53
So
1.46
SO
1.39
so
1.32
Sooo
1.19
Så
1.12
SO
1.08
sooo
1.04
sooooo
1.03
Activations Density 0.101%