INDEX
Explanations
phrases indicating repetition or nostalgia
New Auto-Interp
Negative Logits
isÃŃ
-0.17
ãĥķãĤ§
-0.17
urum
-0.16
erring
-0.15
ondon
-0.14
Marvin
-0.14
erland
-0.14
enticator
-0.14
ozor
-0.14
oksen
-0.14
POSITIVE LOGITS
again
0.32
again
0.26
Again
0.24
repeat
0.23
Again
0.23
повÑĤоÑĢ
0.21
Repeat
0.21
novamente
0.21
lại
0.20
AGAIN
0.20
Activations Density 0.204%