INDEX
Explanations
references to processes involving sending, reaching, or transferring items or information
New Auto-Interp
Negative Logits
orro
-0.16
loff
-0.15
erton
-0.14
jal
-0.14
eam
-0.14
lotte
-0.14
ój
-0.13
eder
-0.13
.Serve
-0.13
solete
-0.13
POSITIVE LOGITS
pper
0.15
iska
0.15
into
0.15
vla
0.15
745
0.15
lava
0.15
bage
0.14
Ñīа
0.14
sat
0.14
sat
0.14
Activations Density 0.161%