INDEX
Explanations
specific high-frequency nouns and values in the text
New Auto-Interp
Negative Logits
opensource
-0.15
ydk
-0.15
¯
-0.15
ythe
-0.14
жив
-0.14
SUBSTITUTE
-0.14
ccione
-0.14
Ñĩий
-0.13
.githubusercontent
-0.13
ADVISED
-0.13
POSITIVE LOGITS
Hubbard
0.15
ateg
0.14
unta
0.14
agle
0.14
Hugh
0.14
eneg
0.14
tout
0.14
umb
0.13
wire
0.13
fram
0.13
Activations Density 0.002%