INDEX
Explanations
references to the concept of "Israel" and its variations
New Auto-Interp
Negative Logits
baiser
-0.16
Banc
-0.15
andan
-0.15
Nile
-0.15
ãĤ
-0.15
ylko
-0.15
ummer
-0.14
ÙĪÙĨد
-0.14
resse
-0.14
:url
-0.14
POSITIVE LOGITS
esh
0.26
eh
0.26
ech
0.23
is
0.22
itz
0.21
ose
0.20
IVO
0.20
iz
0.20
asher
0.19
chez
0.19
Activations Density 0.006%