INDEX
Explanations
mentions of "ink" in various contexts
New Auto-Interp
Negative Logits
dish
-0.14
illance
-0.14
emachine
-0.14
zd
-0.14
tered
-0.14
.sg
-0.14
Dish
-0.13
enger
-0.13
iani
-0.13
inal
-0.13
POSITIVE LOGITS
ãĥĭãĤ¢
0.17
éĭ
0.16
éru
0.16
rc
0.15
pund
0.15
pit
0.14
egment
0.14
emain
0.14
ihad
0.14
sse
0.13
Activations Density 0.008%