INDEX
Explanations
words related to medication or treatment
the presence of the token "ke" in various forms
New Auto-Interp
Negative Logits
ourced
-0.69
Palestin
-0.68
ourcing
-0.67
ohydrate
-0.64
é¾
-0.62
ashtra
-0.62
aceutical
-0.61
anamo
-0.60
MpServer
-0.60
aciously
-0.59
POSITIVE LOGITS
ller
1.34
llers
1.33
ptic
1.17
vich
1.09
lling
1.07
ls
1.03
letal
1.01
jriwal
0.95
rette
0.94
rer
0.93
Activations Density 0.027%