INDEX
Explanations
punctuation marks and their usage
New Auto-Interp
Negative Logits
jadx
-0.18
bakan
-0.15
ipple
-0.14
ipples
-0.14
hatt
-0.14
ác
-0.14
iode
-0.14
(Of
-0.14
iminal
-0.14
frage
-0.13
POSITIVE LOGITS
talking
0.28
Talking
0.27
Talking
0.23
similarly
0.20
CAP
0.19
apart
0.19
talks
0.18
Sources
0.18
sources
0.18
Caption
0.18
Activations Density 0.008%