INDEX
Explanations
casual speech and colloquial expressions
New Auto-Interp
Negative Logits
jac
-0.18
ulin
-0.15
ça
-0.15
vio
-0.15
ErrorException
-0.14
stown
-0.14
yla
-0.14
ãĥ³ãĥĨ
-0.14
_Tis
-0.14
ocht
-0.14
POSITIVE LOGITS
-ing
0.39
'd
0.31
-ed
0.30
’d
0.30
ing
0.30
'
0.26
’
0.25
ing
0.24
'er
0.23
'es
0.22
Activations Density 0.102%