INDEX
Explanations
specific names or terms associated with people or characters
HoJos, Ophiucus, Sosa, Vrooder
New Auto-Interp
Negative Logits
↵
-0.51
-0.40
↵↵
-0.39
s
-0.36
_
-0.34
下
-0.34
上
-0.33
-0.33
x
-0.32
-0.32
POSITIVE LOGITS
pleaſure
0.96
faſt
0.94
queſta
0.91
iſchen
0.90
itſelf
0.87
iſche
0.87
queſto
0.85
ſcher
0.85
<unused14>
0.83
<unused8>
0.83
Activations Density 0.205%