INDEX
Explanations
instances of the word "Mandarin."
references to the Mandarin language and discussions about accents
New Auto-Interp
Negative Logits
apego
-0.86
tics
-0.80
dies
-0.80
tical
-0.79
bs
-0.77
izons
-0.76
gs
-0.76
=-=-=-=-=-=-=-=-
-0.75
Higgins
-0.75
ds
-0.74
POSITIVE LOGITS
Mandarin
1.02
accents
0.94
pronunciation
0.89
dialect
0.83
transcription
0.82
accent
0.80
artic
0.76
poisoning
0.75
pron
0.74
rained
0.72
Activations Density 0.017%