INDEX
Negative Logits
θν
0.78
Hence
0.77
múlti
0.75
monotonically
0.75
Denote
0.74
Denote
0.73
Suppose
0.73
ẏ
0.72
üglich
0.72
_{+}0.71
POSITIVE LOGITS
scrubs
0.72
undeveloped
0.61
cheered
0.60
Teri
0.59
Canaan
0.59
dystopian
0.58
chec
0.56
libra
0.56
puppies
0.56
librarian
0.56
Activations Density 0.208%