INDEX
Negative Logits
bras
-0.07
/reference
-0.07
.street
-0.07
paperwork
-0.06
fist
-0.06
-0.06
�
-0.06
(long
-0.06
facets
-0.06
Sciences
-0.06
POSITIVE LOGITS
abor
0.07
�
0.06
disdain
0.06
Unknown
0.06
compar
0.06
outu
0.06
inese
0.06
PARAM
0.06
IOR
0.06
ignored
0.06
Activations Density 0.034%