INDEX
Explanations
punctuation marks indicating emphasis or dramatic effect
New Auto-Interp
Negative Logits
Mob
-0.71
Paso
-0.70
ciples
-0.67
Slug
-0.66
Dragons
-0.62
Elys
-0.62
Folder
-0.59
Panda
-0.59
Stard
-0.59
Gard
-0.58
POSITIVE LOGITS
gasp
0.92
perhaps
0.91
albeit
0.89
again
0.88
––
0.86
almost
0.83
conserv
0.78
quite
0.77
surprise
0.76
importantly
0.76
Activations Density 0.100%