INDEX
Explanations
references to a specific individual named Bilal
New Auto-Interp
Negative Logits
Wunused
-0.15
ÎľÎ¿Î½
-0.15
åIJ¹
-0.15
ussy
-0.15
ubat
-0.14
emons
-0.14
egl
-0.14
unc
-0.14
obuf
-0.14
ТÐŀ
-0.14
POSITIVE LOGITS
ateral
0.25
bao
0.25
bil
0.21
Bil
0.20
gewater
0.19
bil
0.17
ibili
0.16
boa
0.16
bill
0.16
ioni
0.15
Activations Density 0.009%