INDEX
Negative Logits
illard
-0.16
ools
-0.15
Fior
-0.15
isiyle
-0.14
iner
-0.14
gart
-0.13
wi
-0.13
intel
-0.13
AdminController
-0.13
Abd
-0.13
POSITIVE LOGITS
fold
0.15
ãĥ³ãĥĩ
0.14
meni
0.14
ä¹İ
0.14
arness
0.14
immers
0.14
oufl
0.14
panion
0.13
ortal
0.13
_apply
0.13
Activations Density 0.005%