INDEX
Explanations
elements that provide insights and reflections on various topics or narratives
New Auto-Interp
Negative Logits
è¯ī
-0.16
mock
-0.15
Invent
-0.15
ption
-0.14
?>"/>↵
-0.14
åİļ
-0.14
说æĺİ
-0.14
(fullfile
-0.14
ipl
-0.13
tempts
-0.13
POSITIVE LOGITS
nug
0.35
tid
0.34
gems
0.33
observations
0.32
insights
0.30
pearls
0.29
observations
0.28
Gems
0.27
insight
0.26
pearl
0.26
Activations Density 0.209%