INDEX
Explanations
notable quotes and their authors
New Auto-Interp
Negative Logits
starting
-0.14
ÑĤÑİ
-0.13
bish
-0.13
kova
-0.13
ãĥ«ãĥķ
-0.13
iani
-0.13
ël
-0.12
enberg
-0.12
detailed
-0.12
reap
-0.12
POSITIVE LOGITS
quoted
0.32
quote
0.29
quote
0.28
Quote
0.28
Quote
0.28
quoted
0.26
quotes
0.26
Quotes
0.25
-quote
0.24
~
0.24
Activations Density 0.115%