INDEX
Explanations
expressions of joy and satisfaction
expressions of joy and pleasure
New Auto-Interp
Negative Logits
enhagen
-0.83
©¶æ
-0.82
mater
-0.72
prison
-0.70
eworld
-0.68
restraining
-0.68
road
-0.68
ioxide
-0.67
vernment
-0.67
xia
-0.66
POSITIVE LOGITS
fully
0.97
delight
0.91
joy
0.91
ILY
0.90
delighted
0.87
iously
0.76
ingly
0.75
ously
0.72
lance
0.72
aston
0.72
Activations Density 0.012%