INDEX
Explanations
punctuation marks that indicate a pause or transition in text
New Auto-Interp
Negative Logits
SIL
-0.15
ÎļÏĮ
-0.14
ãĥķãĤ
-0.14
éĮ²
-0.14
paralle
-0.14
arith
-0.14
deaux
-0.13
Punk
-0.13
b
-0.13
ful
-0.13
POSITIVE LOGITS
ouch
0.17
onde
0.17
onda
0.16
lesbi
0.16
>Error
0.14
åĭĻ
0.14
krv
0.14
yourselves
0.14
eli
0.14
ILES
0.14
Activations Density 0.237%