INDEX
Explanations
variations of the word "participate" and its derivatives
New Auto-Interp
Negative Logits
inged
-0.18
zeÅĪ
-0.15
ingers
-0.15
tÄĽ
-0.14
halt
-0.14
bug
-0.14
änger
-0.14
liž
-0.14
isé
-0.14
eat
-0.14
POSITIVE LOGITS
ipation
0.25
les
0.23
atory
0.22
LES
0.19
ip
0.18
abra
0.17
antes
0.17
ipp
0.16
ipe
0.16
iple
0.16
Activations Density 0.006%