INDEX
Explanations
text snippets related to spelling words out
references to the act of spelling
New Auto-Interp
Negative Logits
romy
-0.76
elaide
-0.75
ribut
-0.74
pheus
-0.74
ivals
-0.73
Kin
-0.71
Reviewer
-0.70
RESULTS
-0.68
Greenwald
-0.67
Belg
-0.67
POSITIVE LOGITS
spelling
1.27
spelled
1.12
spell
0.89
bee
0.89
phon
0.84
abbrevi
0.83
shortened
0.80
pronunciation
0.79
deaf
0.78
casting
0.78
Activations Density 0.014%