INDEX
Explanations
instances of the word "Ok" and its variations in different contexts
New Auto-Interp
Negative Logits
anca
-0.17
enko
-0.16
agnar
-0.16
PLY
-0.16
fty
-0.16
addy
-0.15
ença
-0.15
ials
-0.15
IZE
-0.15
encer
-0.15
POSITIVE LOGITS
tober
0.34
lahoma
0.30
anagan
0.27
lah
0.26
ahoma
0.26
hots
0.23
ategor
0.22
amoto
0.21
asaki
0.20
ays
0.20
Activations Density 0.015%