INDEX
Explanations
emotional expressions and sentiments related to love and care
New Auto-Interp
Negative Logits
813
-0.15
jan
-0.14
ally
-0.14
è±Ĩ
-0.14
olin
-0.14
.defaults
-0.14
uzzi
-0.14
bomb
-0.14
adies
-0.13
yards
-0.13
POSITIVE LOGITS
Broken
0.21
break
0.21
broken
0.20
broken
0.20
Broken
0.20
break
0.18
Break
0.18
breaking
0.18
Breaking
0.17
wrench
0.16
Activations Density 0.015%