INDEX
Explanations
details related to educational institutions and their activities.
The neuron spikes on the very first word of a new paragraph or section (i.e. the token immediately following a blank‐line break).
New Auto-Interp
Negative Logits
ными
-0.07
چند
-0.07
ования
-0.06
γ
-0.06
ions
-0.06
goodness
-0.06
"*",
-0.06
ACY
-0.06
енного
-0.06
retarded
-0.06
POSITIVE LOGITS
.↵
0.07
.’↵↵
0.07
.Disclaimer
0.07
).↵
0.07
arest
0.07
%.↵
0.07
ैं।↵
0.06
ै.↵
0.06
///↵
0.06
ें।↵
0.06
Activations Density 0.379%