INDEX
Explanations
expressions of strong affection
expressions of strong affection and appreciation
New Auto-Interp
Negative Logits
Accounting
-0.61
Transcript
-0.61
izable
-0.60
hybrids
-0.58
Shape
-0.57
improvised
-0.57
plur
-0.57
randomized
-0.57
TRY
-0.56
Nanto
-0.56
POSITIVE LOGITS
dearly
1.27
much
1.11
uncond
1.08
passionately
1.05
greatly
1.01
much
1.00
MUCH
0.98
badly
0.91
immensely
0.88
deeply
0.87
Activations Density 0.186%