INDEX
Negative Logits
τρό
0.51
interferometer
0.45
WaitingTime
0.43
caches
0.43
けれど
0.42
waivers
0.42
ヘア
0.42
ivvu
0.42
frosts
0.42
batches
0.41
POSITIVE LOGITS
appreciated
0.48
discouraged
0.47
unclear
0.45
happy
0.45
burdened
0.45
motivated
0.45
conducive
0.45
inflated
0.44
ensical
0.44
impeded
0.44
Activations Density 0.000%