INDEX
Negative Logits
)?
-0.78
]).
-0.73
)))
-0.72
)).
-0.70
])
-0.70
?ãĢį
-0.69
?)
-0.69
))
-0.68
attRot
-0.66
)))
-0.66
POSITIVE LOGITS
wherever
0.76
whereas
0.74
effortlessly
0.66
and
0.66
beautifully
0.65
alongside
0.64
throughout
0.64
while
0.63
lessly
0.61
faithfully
0.61
Activations Density 1.177%