INDEX
Explanations
expressions of gratitude towards religious figures or deities
New Auto-Interp
Negative Logits
blank
-0.16
xee
-0.15
blank
-0.14
orta
-0.14
istrator
-0.14
istra
-0.14
lain
-0.14
ะà¹ģ
-0.13
emachine
-0.13
etter
-0.13
POSITIVE LOGITS
天天
0.15
Orient
0.15
enberg
0.14
Tie
0.14
kk
0.14
enis
0.14
tay
0.13
аÑĢÑĸ
0.13
inge
0.13
backs
0.13
Activations Density 0.067%