INDEX
Negative Logits
llo
-0.59
Jaffe
-0.57
Ad
-0.57
あるので
-0.56
ppelt
-0.56
Parrish
-0.56
koneksi
-0.56
Bonita
-0.56
of
-0.55
ho
-0.55
POSITIVE LOGITS
</
1.66
."</
1.56
)</
1.51
"</
1.49
}</
1.39
)}</
1.32
.</
1.31
'</
1.30
!!</
1.29
?></
1.27
Activations Density 0.071%