INDEX
Explanations
mentions of the word "parent" in various contexts
New Auto-Interp
Negative Logits
loff
-0.15
eriod
-0.15
رد
-0.14
Hol
-0.14
iban
-0.14
éĸĵ
-0.14
yer
-0.13
erk
-0.13
erte
-0.13
yan
-0.13
POSITIVE LOGITS
roupe
0.17
eral
0.16
erals
0.16
orex
0.14
.fp
0.14
gio
0.14
'gc
0.14
htable
0.14
alie
0.14
ooled
0.13
Activations Density 0.017%