INDEX
Explanations
expressions indicating a certain sentiment or attitude towards a situation or statement
expressions indicating familiarity or mediocrity
New Auto-Interp
Negative Logits
oulos
-0.86
perty
-0.72
edia
-0.72
VIDEOS
-0.69
IBLE
-0.68
oppers
-0.64
KS
-0.62
ilitary
-0.62
å§«
-0.62
çīĪ
-0.61
POSITIVE LOGITS
este
0.74
¹
0.69
cast
0.69
nered
0.68
grapes
0.68
hearted
0.67
etter
0.65
ling
0.64
lier
0.64
assy
0.64
Activations Density 0.026%