INDEX
Explanations
instances of the phrase "I think."
New Auto-Interp
Negative Logits
事に
-0.47
ouwen
-0.46
sū
-0.43
x
-0.42
vinyle
-0.42
my
-0.42
Darlington
-0.41
事を
-0.41
化
-0.41
mely
-0.41
POSITIVE LOGITS
umably
1.13
glaube
1.11
perhaps
1.04
Probably
1.02
おそらく
1.02
mutlich
1.02
probably
1.01
probablemente
1.00
probably
1.00
Probably
0.99
Activations Density 0.208%