INDEX
Explanations
phrases questioning beliefs and exploring existential dilemmas
New Auto-Interp
Negative Logits
WithURL
-0.17
klu
-0.16
rello
-0.15
TOP
-0.15
ayas
-0.15
Nope
-0.14
.scalablytyped
-0.14
λοÏħ
-0.14
geh
-0.14
awner
-0.14
POSITIVE LOGITS
then
0.32
çļĦè¯Ŀ
0.28
then
0.27
então
0.26
Then
0.24
ÑĤогда
0.23
then
0.23
entonces
0.23
alors
0.22
Then
0.21
Activations Density 0.131%