INDEX
Explanations
Seems like neuron 4 is activating for misspelled variations of the word "somewhat"
occurrences of the word "somewhat" and related variations
New Auto-Interp
Negative Logits
gio
-0.73
Prosecut
-0.72
holm
-0.71
Kh
-0.71
Feld
-0.68
Reviewer
-0.67
Luxembourg
-0.65
xual
-0.65
cv
-0.63
selection
-0.62
POSITIVE LOGITS
ety
1.08
ently
1.02
ency
1.01
hest
0.97
ards
0.97
atell
0.95
actory
0.91
ato
0.91
ences
0.90
eday
0.90
Activations Density 0.028%