INDEX
Negative Logits
梖
0.96
欱
0.91
涫
0.88
!!!!
0.85
Presumably
0.84
uključ
0.84
lateribus
0.84
presumably
0.83
일반적으로
0.82
婜
0.81
POSITIVE LOGITS
glee
0.83
plight
0.79
untrue
0.78
tonight
0.78
sublime
0.77
gonna
0.76
spree
0.76
strife
0.75
decree
0.75
bright
0.74
Activations Density 0.774%