INDEX
Explanations
dialogue or conversational exchanges
Quoted dialogue or speech snippets
I say / I mean / I was
New Auto-Interp
Negative Logits
Notably
-0.80
cref
-0.76
relevantes
-0.75
impactful
-0.74
souhaitez
-0.72
Importantly
-0.72
curated
-0.72
TLDR
-0.71
Ensuring
-0.71
relevancia
-0.70
POSITIVE LOGITS
daß
1.05
muß
1.01
müßte
0.96
Daß
0.94
mußte
0.92
mußten
0.91
Schluß
0.89
biß
0.86
damned
0.83
läßt
0.80
Activations Density 0.508%