INDEX
Explanations
personal possessive pronouns followed by specific nouns
plural forms of the letter 's' in different contexts
New Auto-Interp
Negative Logits
Hasan
-0.72
Presidents
-0.72
indemn
-0.71
boycot
-0.67
Grateful
-0.62
Suns
-0.61
Islamic
-0.60
Republican
-0.59
Ping
-0.59
Sinn
-0.58
POSITIVE LOGITS
pecially
1.14
lightly
1.13
atisf
1.11
uddenly
1.05
ELF
1.05
ustainable
1.02
ources
0.99
omew
0.99
ustain
0.98
̶
0.94
Activations Density 0.303%