INDEX
Negative Logits
coy
-0.09
Himself
-0.08
rosa
-0.08
squad
-0.08
wille
-0.08
iode
-0.08
fle
-0.08
-0.08
прият
-0.07
-playing
-0.07
POSITIVE LOGITS
reb
0.08
uburyo
0.08
rị
0.08
reconciliation
0.08
reconcile
0.07
digits
0.07
rebbe
0.07
feasibility
0.07
parity
0.07
interplay
0.07
Activations Density 0.008%